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CCATTCCAAA 
GAACAGAGTG 
GGTACAGTAT 
GCTGACAGCT 
CAGATGATGA 
ATAGGAAATT 
CACTGAAAGT 
TGGCAGATTA 
TCTTCATTTA 
GTCCTATGCC 
TTGCTGTGGC 
GCCAAGGAAG 
TGGATGGCCG 
TTGGAGCAAT 
GTAATCAGAG 
CAATGGCAAC 
TTTCGCTCCT 
CTTACCTGCA 
AATACCCTGC 
TCAACAACAC 
AGTTCTGATG 
GCTGGATGAG 
ATGAAGCTCA 
TCCCGGACGG 
CCCTGAGGTC 
AAATGCAAAC 
CTGCTTGCCG 
TGAATTACTT 
TCATGGTTCG 
GTCCTCTTCC 
ACCCTTCAGC 
TTAATAATGT 
TTCATAGGGG 
I I I I IATCTT 
TCGGGGCTTT 
ATGGCACTCA 
TCACTAACAC 



CAAGTCAGGA 
TTCTGAACAT 
ATTATTTACA 
CAAATGGATC 
GAGCAGCAGT 
CAGAAAAGGC 
CAGAAATTCC 
TGCTGATGAA 
ACCTGAGTAA 
ATGGCCTACA 
AATATTATCA 
GAGGGTCTTT 
GGAAAAATTG 
GTCAAGCTAC 
CATTCATGGG 
TACCTCATCA 
TAAAAATTTA 
TGGTGTTTTT 
CCTCTACCTG 
GCTTCCAATG 
TGAACTTCAT 
AACCAGGCCA 
TAGTGATGAC 
CCTATGCAAT 
CTTCCCATCT 
GGTGTCAAAT 
CCCTCTTTGG 
CATGCCTACA 
CCTGGCAGTC 
CAATTCGTAC 
TGGATACGAC 
TCTGGTCATC 
CTTCTTCTGC 
AAACTTGTCA 
AATTTTCCTT 
TTATAATTGA 
AAGGAAAAAT 



AAGCCTGCAC 
CAACACAAAG 
CTGAAGGGGC 
CCATGGAACT 
GGAGAAAGTG 
AGCAATGAGC 
TGACAAATGG 
CACCATCCCG 
TGCCATCATG 
CAGGGGTCAT 
CTGTATTCAG 
GATTTATGAA 
GAGCTTTTGT 
CTCTTTATCA 
ACTTGAAGAA 
TATTTGTGTC 
GGTTATCTTG 
TGTTAGTGTG 
TTTTGGATCA 
CATGTGGTAA 
GATGGATTAC 
AGGGCTCTCT 
AAGTGTGAAC 
TCCTATCCTA 
ACAGTGAACT 
ATTTCCATCA 
TTACCTAACC 
GCAAAGTGTA 
CTTGTGGCAG 
ATCAGTGATC 
ATTTCCTGAT 
CTTGTGCCAA 
CACTATGCTG 
AGAAAGAAAC 
GTGGTTGGAA 
CTGGATTTAT 
AC (SEQ ID 



AGGACTGGAT 
TGGAAGAACC 
TTGTGTGTGG 
GAGAAATGTC 
CTCCAGATAG 
AGTCAATTTG 
Al I I I IGGGG 
GAACCACTTC 
GGCAGTGGGA 
AC I I I I I ATA 
TTCACC I I I I 
AAATTAGGAG 
TTCCATTACA 
TTAAATATGA 
AATACTGGAG 
TGTTGGAATT 
GCTATACCAG 
GTGATTTACA 
CAGTGTTGGA 
TGTTACCCAA 
ACCCACCGCA 
TCATGACAGT 
CCAAATACTT 
GTATTTGCTT 
TAAAGATCGG 
CGGGGATGCT 
TTCTATGGAG 
TACATTAGAC 
TAACACAAAC 
ACACTGTTAT 
TGCAGCTGTG 
CTATAAAATA 
Al I I I IATTC 
TTTTAGGTCA 
TATTCTTCAT 
GATCCTCCAA 
NO:l) 



AAATAATTAA 
TTAAGCTGAA 
ACAAGAAAGC 
AACATCGAAC 
CTACATCAGG 
CTAATGAAGA 
AAAAAGAAGC 
CTTTGGAATG 
TCCTGGGCTT 
ATCATGCTGC 
ATTAAAAACA 
AAAAGGCATT 
ATGCAGAACA 
ACTACCTGAA 
AATGGTACCT 
ATTCTTCCAC 
TGGATTTTCT 
AGAAATTCCA 
AATCTGTCAT 
CAACTCTGAG 
ATCCTGCAGG 
GGAGTAGAAT 
TGTATTCAAC 
TTGTATGCCA 
TCCCGGAGAA 
TGTGATGTAC 
AAGTTGAAGA 
ATCCCTCTTC 
TGTGCCCATT 
TTCCCAAACG 
CTTATTGCAC 
CATCTTCGGA 
TTCCAGCAGT 
CCCCAAAAGG 
GATTGGAAGC 
ATTCCAAGCA 
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FEATURES: 

S'UTR: 1-163 

Start Codon: 164 

Stop Codon: 1805 

3'lTTR: 1808 
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HOMOLOGOUS PROTEINS: 
Top blast Hits: 

CRA | 145000039337444 /altid=gi 
CRA | 114000033649823 /altid=gi 
CRA | 160000003782430 /altid=gi 
CRA | 148000002720069 /altid=gi 
CRA | 87000000006802 /altid=gn| 
CRA | 18000005069115 /altid=gi | 
CRA | 88000001154721 /altid=gi | 
CRA | 66000019404613 /altid=gi | 
CRA | 100000004435450 /altid=gi 
CRA 1 335001098689635 /al ti d=gi 



112017941 /def=gb|AAG45335.1|AF295. 
1 10945621 /def =gb | AAG24618 . 1 1 AF298 . 
1 8677401 /def =gb | AAF75 589.2 1 AF1736 . 
1 8248427 /def=gb | AAF74195 . 1 1 AF2496 . 
7243145 /def=dbj | BAA92620.il (AB03. 
5870893 /def=ref|NP_O06832.1| tran. 
7406950 /def=gb|AAF61849.1|AFl5985. 
9506837 /def=ref|NP_061849.1| amin. 
1 8926332 /def =gb | AAF81797 . 1 1 AF2730 . 
111434147 /def=ref|XP_006635.1| hy. 



EST: 



gi 
gi 
gn 
gi 
gi 



110934204 /dataset=dbest /taxon=96.. 
110286121 /dataset=dbest /taxon=96.. 
19872634 /dataset=dbest /taxon=960.. 
12656674 /dataset=dbest /taxon=9606 
19882497 /dataset=dbest /taxon=960.. 
1689641 /dataset=dbest /taxon=9606 / 



975 
597 
591 
587 
578 
500 
496 
495 
492 
480 



® E 
0.0 
e-169 
e-168 
e-166 
e-164 
e-140 
e-139 
e-139 
e-138 
e-134 



1072 
718 
680 
549 
541 
525 



EXPRESSION INFORMATION FOR MODULATORY USE: 
library source: 

Expression information from BLAST dbEST hits: 

gi 110934204 whole embryo (mainly head) 

gi 110286121 Hepatocellular carcinoma 

gi 19872634 Non-cancerous liver 

gi 12656674 Fetal liver spleen 

gi 19882497 Non cancerous liver 

gi 1 689641 Liver 

Expression information from PCR-based tissue screening panels: 
Mixed tissue (Brain, Heart, Kidney, Lung, Spleen, Testis, Leukocyte) 
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1 IVDPMELRNVN 
51 KFLTNGFLGK 
101 AYTGVILFII 
151 KIGAFVSrTM 
201 LIIFVSVGII 
251 LPVLDHSVGN 
301 Q4KGSLHDSG 
351 PIYSELKDRS 
401 AYSKVYTLDI 
451 IRHFLIAAVL 
501 LVKKETFRSP 



IEPDDESSSG ESAPDSYIRI GNSEKAAMSS QFANEDTESQ 
KKLADYADEH HPGTTSFGMS SFNLSNAIMG SGILGLSYAM 
MLLAVAILSL YSVHLLLKTA KEGGSLIYEK LGEKAFGWPG 
QNIGAMSSYL FIIKYELPEV IRAFMGLEEN TGEWYLNGNY 
LPLSLLKNLG YLGYT5GFSL TCMVFFVSW IYKKFQIPCP 
LSFNNTLPMH WMLPNNSES SDVNFIWDYT HRNPAGLDEN 
VEYEAHSDDK CEPKYFVFNS RTAYAIPILV FAFVCHPEVL 
RRKMQTVSNI STTGMLVMYL LAALFGYLTF YGEVEDELLH 
PLLMVRLAVL VAVTQTVPIV LFPIRTSVTT LLFFKRPFSW 
IALNNVLVIL VPTIKYIFGF IGASSATMLI FILPAVFYLK 
QKVGALIFLV VGIFFMIGSM ALIIIDWIYD PPNSKHH (SEQ ID NO: 2) 




FEATURES: 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_GLYCOSYLATTON 



Q 



N-glycosylation site jjj 



Number of matches: 5 



> 



CD 
C-sJ 



CO 



1 83-86 NLSN UJ „ £ 

2 260-263 NLSF Q _. ^ 

3 264-267 NNTL LU => 

4 276-279 NNSE CC "* nz 

5 369-372 NISI ££{ 

i — 

[2] PDOC00004 PS00004 CAMP_PHOSPHO _JSTTE 

camp- and cGMP-dependent protein kinase phosphorylation site 

503-506 KKET 

[3] PDOC00005 PS00005 PKC_PHOSPHO_3rTE 
Protein kinase C phosphorylation site 

Number of matches: 7 

1 33-35 SEK 

2 49-51 SQK 

3 129-131 TAK 

4 290-292 THR 

5 360-362 SRR 

6 473-475 TTK 

7 506-508 TFR 

[4] PDOC00006 PS00006 CK2_PHOSPHO_SrTE 
Casein kinase II phosphorylation site 

Number of matches: 5 

1 18-21 SSGE 

2 22-25 SAPD 

3 129-132 TAKE 

4 305-308 SLHD 

5 309-312 SGVE 
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[5] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 6 

1 95-100 GLSYAM 

2 153-158 GAFVSI 

3 164-169 GAMSSY 

4 186-191 GLEENT 

5 296-301 GLDENQ 

6 482-487 GASSAT 

[6] PDOC00009 PS00009 AMIDATTON 
Amidation site 

58-61 LGKK 



Membrane spanning structure and domains: 



lix Begin 


End 


Score Certainty 


1 


79 


99 


1.125 


Certai n 


2 


102 


122 


2.503 


Certai n 


3 


153 


173 


1.197 


Certai n 


4 


197 


217 


1.785 


Certai n 


5 


222 


242 


2.123 


Certai n 


6 


332 


352 


1.240 


Certai n 


7 


370 


390 


2.166 


Certain 


8 


414 


434 


1.301 


Certain 


9 


453 


473 


1.520 


Certain 


10 


476 


496 


2.166 


Certain 


11 


515 


535 


2.628 


Certain 
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BLAST Alignment to Top Hit: 

>CRA 1 145000039337444 /al ti d=gi 1 12017941 

/def=gb|AAG45335.1|AF295535_l (AF295535) amino acid 
transport system A3 [Rattus norvegicus] /org=Rattus 
norvegicus /taxon=10116 /dataset=nraa /length=547 
Length = 547 

Score = 975 bits (2492), Expect = 0.0 

Identities = 478/547 (87%), Positives = 508/547 (92%) 

NDPMELRNVNIEPDDESSSGESAFDSYIRIGNSEKAAMSSQFANEDT^ 60 
MDP+ELR+VNI EP ++S S +S Y +GNSEK AM SQFANED ESQKFLTNGFLGK 
IvDPIELRSN^IEPYEDSCSVDSIQSOTGMGNSEKGAIvDSQFANEDAESQKFL 60 

KK LADYADEHH PGTTS FGMSS FN LSNAIMGSGI LGLS YAMAYTGVI L FIIML LAVAI LS L 120 
K L D YADEHH PGTTS FGMS S FN LSNAIMGSGI LG LS YAMA TG++LF+IMLL VAILSL 
KTLTDYADEHH PGTTS FGMS S FNLSNAIMGSGI LGLS YAMANTGIVLFVTML LTVAI LS L 120 

YSVHLLLKTAKEGGSLIYEKLGEKAFGWPGIttGAFVSrT^ 180 
YSVHLLLKTAKEGGSLIYEKLGEKAFGWPGKTGAF+SrTT^^IGAN^ 

YSVHLLLKTAKEGGSLIYEKLGEKAFGWrcKIGAFTSJT^ 180 

IRAFMGLEENTGB/VYLNIGNIYLIIFVSVGIILPLSLLKNLGYLGYTSGFSLTC^FFV 240 
IR FMSLEENrrGEvmJ^GNYL^FVSVGIILPLSLLKNLGYLGYTSGFSLTO^FFVSW 
IRVFMGLEENTGEWYLNGNT/LVLFVSVGIILPLSLLKNLGYLGYTSGFSLTCMVFFVSW 240 

IYKKFQI PCPLPVLDHSVGN LS FNNTLPMHNA/MLPNNSESSDVN FMVDYTHRNPAGLDEN 300 
IYKKFQIPCPLPVLDH+ GNL+FNNTLPMHV+MLPNNSES+ +NFM+DYTHR+P GLDE 
IYKKFQIPCPLPVU>1NNGNLTFNN^ 300 

QAKGS LHDSGVEYEAHSDDKCEPKYFVFNSRTAYAI PI LVFAFVCHPEVL PIYSELKDRS 360 
A G LH SGVEYEAHS DKC+ PKYFVFNSRTAYAI PI L FAFVCH PEVL PIYS ELKDRS 
PAAGPLr^GVEYTAHSGDKCQPKYFVFNSRTAYAI PI LAFAFVCHPEVL PIYS ELKDRS 360 

RRKMC?WSNISITGMLVMYLIJ\ALFGYLTFYGEV 420 
RRKNQTVSNISrTGMLVMYLUVkLFGYL+FYGEVEDELL D LLMVRLAVL 

RRK^WSNISTTGMLN/NrYLUV^LFGYLSF^ 420 

VAVTQTVPIVL FPIRTSVTTLL FPKRPFSWI RH FLIAAVLIALNNVLVT LVPTTKYI FG F 480 
VAVT WPWLFPIRTSVTTLLFP+RPFSWf+HF IAA-H-IALNNVLVT LVPTTKYI FG F 
VAVTLWPWLFPIRTSVTTLLFPRRPFSWVKHFGIAA 480 

IGASSATMLIFILPAVFYLKLVKKETFRSPC^GALIFLW 540 
IGASSATMLI FT L PA FYLKLVKKE RS PQK+GAL+ FLV GI FM+GSMALIIIDWIY+ 



PPN HH 

PPNPDHH 547 (SEQ ID NO : 4) _ 

Q 

iu 

>CRA 1 114000033649823 /al ti d=gi 1 10945621 > 
/def=gb|AAG24618.1|AF298897_JL (AF298897) amino acid £D 
transporter system A [Homo sapiens] /org=Homo sapiens £j 



Query: 


1 


Sbjct: 


1 


Query: 


61 


Sbjct: 


61 


Query: 


121 


Sbjct: 


121 


Query: 


181 


Sbjct: 


181 


Query: 


241 


Sbjct: 


241 


Query: 


301 


Sbjct: 


301 


Query: 


361 


Sbjct: 


361 


Query: 


421 


Sbjct: 


421 


Query: 


481 


Sbjct: 


481 


Query: 


541 


Sbjct: 


541 
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/taxon=9606 /dataset=nraa /length=506 
Length = 506 

Score = 597 bits (1522), Expect = e-169 

Identities = 315/549 (57%), Positives = 383/549 (69%), Gaps = 46/549 (8%) 

MDPMELRNVNIEPDDESSSGESAPD SYIRIGNSEKAAMSSQFANEDTESQKFLTNGF 57 

M E+ +1 PD++SSS S D SY +++AA+ S +Af D E+Q FL 
MKKAEMGRFSISPDEDSSSYSSNSDFNYSY FTKQAALKSHYADVDPENQNFLLESN 56 

LGKKKU\DYADEHHPGTT5F*^SFNLSNAIM^ 117 
LGKXK Y E HPGTTSFGMS FNLSNAI+GSGILGLSYAMA TG+ LFII+L V+I 
LGKKK YETEmPGTT5FGI^VFNLSNAIVGSGILGLSYAMAI^TGIALFIILLTFVSI 113 



SLYSVHLLLKTA EGGSL+YE+LG KAFG GK+ A SrTMQNIGAMSSYLFI+KYEL 



P VI+A +E+ TG WYLNGNYL++ VS+ +ILPLSL +NLGYLGYTSG SL CMVFF+ 



Query: 


1 


Sbjct: 


1 


Query: 


58 


Sbjct: 


57 


Query: 


118 


Sbjct: 


114 


Query: 


178 


Sbjct: 


174 


Query: 


238 


Sbjct: 


234 


Query: 


298 


Sbjct: 


270 


Query: 


358 


Sbjct: 


318 


Query: 


418 


Sbjct: 


378 


Query: 


478 


Sbjct: 


438 


Query: 


538 


Sbjct: 


498 



LU 



DRSRR+M VS IS M -HMYLLAALFGYLTFY VE ELLH YS + DI LL+VRL 



AVL+AVT TVP+V+FPIR+SV LL + FSW RH LI ++A N+LVT VPTI+ I 



FGFIGAS+A+MLIFILP+ FY+KLVKKE +S QK+GAL FL+ G+ M GSMALI++DW 



3 H 

3 GGGH 506 (SEQ ID NO :5) 
ttrrner search results (Pfam): 

Model Desc ri pti on Score E-val ue N 

PF01490 Transmembrane amino acid transporter protein 187.0 2.9e-52 2 

CE00398 E00398 CD53 4.0 4.8 1 

Parsed for domains: 

Model Domain seq-f seq-t hmro-f hmm-t score E-val ue 

CE00398 1/1 90 110 .. 1 23 [. 4.0 4.8 
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PF01490 1/2 99 236 .. 1 179 [. 58.9 2.5e-14 
PF01490 2/2 305 529 .. 200 467 .] 133.9 3e-36 
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1 AGCTTAGCAA TATGGATCAA GAGGTCCAAT ACCTGATTAA TAAAAGTTTC 
51 AGGAGTAAAC AAAGGGGAAG AAATAG I I I I TTTAAATAGT AGAAC I I 1 I I 
101 TTAI I I I IAG AAAATGTGTC TTCTATAGAA GAAAGACAAG CCTTTTGATT 
151 GGGCCGTCTG CATGCTGAGT ATGATGAATT TTAAAAGCGA CTCACATCTA 
201 GTCACGTCGT GATGAAAGGA TAAGGATAAA AATTCTGAAA TCCTCAGAAA 
251 ACCATCGATA AATTATCTAT AAAGAAATAA GAGCCAGACT CATCAATAGA 
301 AGCTAGAAGA GAGAAGTTTC TTCAATATTC TGAAGGAAAA TGCTTCTGAA 
351 TCTAGAATTC AAACAATTAA CAAAGTTTGA AGGCAAAATA AAGAATTTTC 
401 CAACATGAAG CAACTCAGAA ATTCTATTTA CAGACATAGG CTCATTGTGT 
451 GAAAAAAGTT ATTCAAGGCA TTATTTTAGC ATAATGCAAA ATAAACTGAA 
501 GAAAGAAGAT AGAATGCCGT TCAAGAAACT AGCAGCTGAG CAAGACTCAG 
551 AGGTTGGAGG AGGAAGCCAT TCAGAATGAG AAAGAGCATA GAAAATTTGC 
601 TTTCAAAGTT TTGGTAATAT AGAATTATAT TTCACTTATT ATGTAGTCAA 
651 ATACACCACT TTGTCTTTAG GGCATACTAT TTATACAGTG ATAATACTGT 
701 AATTGCTGCT TATTGG I I I I CCATGTTTAG AAACAACCTA CAGGCAAGTT 
751 ATGACACTTG TTTCACAGAA CAAGATGAAA ATATTATGAT TCTCAAATTG 
801 TAAAAGTATT TTATTAACTA AAATAATTAG GAGTGTAGGA GAAGGAAGGA 
851 AAGAAAGAAA AAGTATGCTA ATGTCCTTAT I I I I IATGGG TAACCAGTCT 
901 AAAATCAGTA AACCAAGTCA AAAAAGCTTT AGTGAATTAT TCAGATCTAG 
951 AATGGCTAAC TTTAAGTAAC AAGCTAAAAA CAGAAACCGT CAATAGTGGT 
1001 TGCTGCTGGG AAGTGAGACT GGTACTGTGT GAAGAATGAG GAAAACCTTT 
1051 GTACTCATTT AGTGAGTTTC I I I I I I I I I I CTTTTACCCA TATGCATGTC 
1101 TTACTTCTAT TCTCTCTTAG CTTTTAACCT GCTTCTTTTC ATCTTTTATG 
1151 TATATACATT TAGGCTGCCT TATATTAATA ATAGTTTCAT TTTTGTTCCT 
1201 CCTGCTTAAA ACACTGTGTG CTAI I I I I I I AAATTCTGAG AACTGCTTTC 
1251 TTTATTTCTA GACAATTCTC TGCCATTATC TCTTTCTGTT TTGTCTCACC 
1301 CTAGTCTCAC AATTCTCTAT ATTGGAATGA CTATCAGTGT ATATTTGAAC 
1351 TTGTAATTCT TAI I I 1 1 ICC CCATTCCTCT TAACTTCTTA TTTGTATTTT 
1401 TCI I I I I I IA ATCTCTTCAT GCTATAATTT GAGTGATTTC CACAGATCTG 
1451 TCTTTCAATT TTATAAGTCT TCCTTCAGCT GAG I I I I I I I AAATTTCAAT 
1501 GATTCTATTT I I I ICI I I I I TTTAAGAATT CC I I I I I I I G ACTCI I I I IG 
1551 CAACAGCCTG TTCTCCI I I I ATATTCCTTT ATAATG I I I I TATTCTGTGA 
1601 AAGTTATTCT CTTATTTTGA ATGTTTTCTT TCAAAATGTC I I ICI I I I IA 
1651 TTAATTTAAT GTAAAAGTCC CTTTTAAATT GCTTTGTTAT TTGTAGTTCC 
1701 T TAGAT GTGA ATTTTATCAT TTCTTGTGCC TACTGGCACT CTTGCTAGTG 
1751 AGTTTCCATG TGTGTTCTAT ATGTTTTGTA ATTTGAGGAT GTGAAC I I I I 
1801 CTCAAGTGTG AGTTGCCTTT CAAAAAAGTA CTGCCATGGC ACTGGGTTGT 
1851 GGAGGTATTC CCATGTGGTA GTTTCTGTTT GTCAGAGGAA TAGCACATTT 
1901 TGTGACTTCT GGAGCAATTT TTATGTTAGT TTCTCTGCTC AAGATTTCCT 
1951 TATCAAATGG GTATTGCACA TGTCATGACC ACACTTTTCA AGAATGATAG 
2001 TGTTTCTCCT AATACGATGG TTCAACAATA ATTGAATGAA TCTAATGGTA 
2051 AGAATTTCAG AAGAAATTAT ATCAACTACA TATAGTAGAT TCAAGGCATT 
2101 TTTCAAAAAC ACAATGCCAG TCCACCCCTT TTCACTATAC AATTGAGGAA 
2151 AATGAGGTCC CCAAATGTTA AATGACTTCT GCTGAGATCC AATGAATTAA 
2201 AGGCAGAGCA GAGGCTAAAA TCTAGATCTC TTTGTTGTTA AAATACATTT 
2251 TAATTTGACA CAGATGATGA GTAATGCTGA CCCAGAGGTA AATCTGAACT 
2301 TTCTTTTGTT ACTATTCTTA ACTTTGGCTT CAGGATCCAA GTGCCTAGAA 
2351 AGTTACTTCC TAAACTTGAT CCTCACCTAT GTTGCATATT ATCAAGCATT 
2401 TGGTGGTGTT AATTCTTTCA TGTCCAATTA AATTAAAGCA GTAATTTTCT 
2451 TTCTAGTTAT TGCTAGTAGA GACACTGGTA GATTCTGCCT TGGTAGACCT 
2501 TCCTCTGTCA ACAATTTACT TTTGTCTTCC I I ICI I I IAA AACATGTATC 
2551 CCACTCACAA ATACCTAAAT TTCCTTGAAG ACTGCTGCCA TGI I I IAAGA 
2601 I I ICI I I I I I TTTCCATAGT GACTAGTAAA ACCTGCCATT TTCATTATAC 
2651 ATAGGCACTC TATAAATATC TGCTAATTTA GCAATTATTA GTAATTTCCT 
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2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
3101 
3151 
3201 
3251 
3301 
3351 
3401 
3451 
3501 
3551 
3601 
3651 
3701 
3751 
3801 
3851 
3901 
3951 
4001 
4051 
4101 
4151 
4201 
4251 
4301 
4351 
4401 
4451 
4501 
4551 
4601 
4651 
4701 
4751 
4801 
4851 
4901 
4951 
5001 
5051 
5101 
5151 
5201 
5251 
5301 
5351 



TTCTTCTCTT 
GGATTTGCTT 
CAGAGAGCAT 
TAAAATTATA 
GAACATCAAC 
TTTACACTGA 
TGGATCCCAT 
AGCAGTGGAG 
AAAGGCAGCA 
AAAATAAGAC 
TGGATTTATT 
ATCAATTCCC 
CTGATTTAGA 
ATAATTTTAA 
TCTTTGAATG 
TAAAAGTATC 
CCAAATATAT 
GATAATTATA 
GTCTAAATTG 
CTCCTATCTT 
AGTGAAAATC 
TAAACTTTGA 
CTTATTAATG 
TCTTACGTTA 
CTCTCCCTGT 
GCAAGAGCAA 
TAAAGCCCTT 
ACACTGCTGG 
CCTGTCCTGT 
ATTCTGCTCC 
TTGACACCTC 
AGCCTTTTCC 
TTCATGTACC 
TAATGGCCTC 
GTGCACAATA 
TCAAATTGTA 
TAGGCGTGAA 
TTACAACACC 
GTAGCTCAGT 
GATGTAAATT 
GGGTGCTAGA 
AGAGAAGCTG 
AACCAACAGC 
CATTCAATAC 
TTGATTCAAA 
TCTGCCGAGG 
AGTGTCATTC 
CTTTTCTTAA 



(_l I I IATCTC 
CAGTTTAAAG 
AATGTAAATT 
TACTTGTGTT 
CTGGGGACAT 
ACAACTCATC 



CCA I I ICI IC 
ATGTAAAGTT 
ACAAAATGTA 
ATGATGTATG 
ACAAAGTGGA 
AGGGGCTTGT 
GGAACTGAGA 
AAAGTGCTCC 
ATGAGCAGGT 
AGGATGTGGA 
ACAGGTAAAT 
TGGTCCTGAA 
AT TGAGGT TA 
TTATTTTAGC 
TTGCTACACC 
CTCTGGGCAA 
GCCATATACC 
TCTGAATGTC 
CCTCATTTAT 
TCCCTCCAAA 
ACTGCTATCA 
CCCACATAGC 
CTTCAAATAC 
GGCCTACATT 
CTTACATTTT 
TCTCTTACAA 
CTGCACCTCT 
GCACGTAAGG 
I I I I IGCCCT 
AACTGCCTGG 
TCATCCTTCA 
AAATTACTCT 
CTCTCCCTGT 
TGTGGTCCCA 
GTTATTTGAT 
CATGATTTAA 
ATAGAGAATA 
CAGGAAAATC 
GCAATGAACT 
ATAGCATTTT 
AGACAGGGCT 
TTTGTCCTCT 
TGCTTTTCAA 
I I I I I I ATTA 
CCTGAGGATG 
TTGACAGAGT 
CACTTGATCC 
GCTTCATGCA 
CTGGCTTAAA 
AGATGGGTTT 
TTTGAGAAAT 
TTCAATACAT 
TTCTCATCTA 
TCATGCCAAA 



CTTTCTTGTA 
TTCAGGAGTT 
GATGATTCAT 
TTGTGTTCTG 
AGAACCTTAA 
GTGTGGACAA 
AATGTCAACA 
AGATAGCTAC 
ATGGGGTTAA 
CATGGAAAAC 
TTGTGATAAC 
ATACAGTGAT 
CAATGTCTTT 
CTATTGCTCC 
TCCTGGTTTC 
GCTCACTCAC 
CAGCCTGTTA 
TACTGCACGT 
AAAGTTAAAC 
ACCTGCTCCT 
TGTAGCAACT 
CAACGGTCAG 
ACCTACTTTT 
AAATGTGAGA 
GCTCTCCTCT 
TTGCAAATTG 
CCAATAGCCA 
TCCTTTGTGA 
CTCCCTATTT 
AATCAGTCAC 
AGAATCAGCT 
ACTCCCCCAT 
GCAGATGTTA 
CCACCTGCCA 
AAGTCAATTG 
GATGCTCAGA 
TTGTGCTCAA 
CATCAGGAGA 
TTAGGGATGT 
CTAAATTAGG 
TTGGAGTCTG 
TTGAGCTTCA 
GGATGAGATG 
CTTCTCCTTC 
TTTCTGAAAG 
TAAAGGGGAC 
TCAAAAGCCA 
ATGTGTTCCG 
CATAGGACAT 
TATCTATGTG 
TCAACATAAT 
GCTGGGTTTG 
TTTTATACAA 
ATAAGAACAT 



TTGGGTAAAG 
TCTTTCCTTC 
ATTCACTTAT 
TTTGCAGAAC 
GCTGAAGGTA 
GAAAGCGCTG 
TCGAACCAGA 
ATCGGGATAG 
AAATTACTAT 
AGGGTCTTGA 
AATGATATTG 
AATGTCAATC 
GTCTCCATTA 
TCTTATCTTT 
TGTAGGGATT 
A ACTAC TATG 
AGTTTCTCTA 
CTCTACTGGA 
CTGTAATGTC 
CCTCTAGTAA 
CACTCAAAAG 
TCATATCCAG 
CTGTACCCAT 
CAGGGAGAGA 
GTCTAGCCCT 
AATCAATTTC 
TAAGAGAAAG 
TCTGTTCTTG 
GTTACTTGTT 
CTGCTCCCCC 
CAACATCAGG 
GTAGAAGTGA 
ATTACGCCAC 
CATTGTCTGG 
ATTTCCCACA 
AGGGAATTTT 
ACAAAGACTT 
AACTACCGTT 
CGGACTAGAG 
TGACCCTTGA 
CAGAGTAGTT 
ATGGAAAATG 
GGTGACCAGA 
ACTGCATTAC 
GCATGCACAC 
ACCCTCCTAA 
GAGTAGAAAG 
AACCACTCAC 
CATTTTGCAG 
TGGTTTGGAT 
GTATTTATTT 
GTATCAAAAC 
TCTTGGCATG 
GCAAATGCCT 



GAACATTTCA 
CTCCCTTTTA 
TTCATTTAAA 
AGAGTGTTCT 
CAGTATATTA 
ACAGCTCAAA 
TGATGAGAGC 
GAAATTCAGA 
GTTCCATGGA 
TGGGAAGAAC 
A TGCTAG CAC 
TCTTTTGTGA 
ATAATGTGTA 
CTCAGATTCC 
CI I I I CTCTC 
GCCTCACCCT 
CTGAATTTCA 
CCATTACTGT 
TAATACTGAA 
TCCCCATCCT 
CCCCTAGGTG 
TTGGTTTGAC 
TCTACTGTGG 
GCCCTGATTT 
CTACACTCCT 
CATCCTTAGA 
TAGATTACAC 
ACCTGCCCCT 
GCCTTCACTC 
TTTCTCCGTG 
TCTCCTATGC 
CTGCCCCTCC 
TACTACAGGT 
TGCATAGTGA 
AAATGTTATA 
TGACCAAATC 
CTCATTTTAT 
CTTCCTTCAA 
AGGCCACTGA 
AGAAACACTA 
GCCTGACTTT 
TAAAATGGCA 
ATATAGATGA 
CCTCAGTAAA 
AAATATGAGC 
GAACTGTCAT 
AGCATGAATG 
AGTGACTTAC 
I I I I IAAAAT 
TGAACCCTTA 
GTGATCATTA 
ATTTAACATA 
TTAAATGACT 
CAAAGAAAGA 
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5401 
5451 
5501 
5551 
5601 
5651 
5701 
5751 
5801 
5851 
5901 
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6551 
6601 
6651 
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6751 
6801 
6851 
6901 
6951 
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7551 
7601 
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7701 
7751 
7801 
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AAATCTGTTT 
GATTTTAGTT 
GGCACTTTTG 
CTGTCATCTC 
CAGGTACATT 
CAGTGGGTGG 
TGTGCTTTCT 
TGTGTACTCA 
TTTGTGGTTT 
CTCAAAAAGT 
TTAGCTTCTG 
TTTCTATTTT 
GTTTCACTCT 
CTGCAACCTC 
GAGTAGCTGG 
ATTTCTAGTA 
TCCTGACCTC 
ACAGGCGTGA 
TCTTCAATTA 
CTAAAATTTT 
TCGAGCATTT 
TAAAAATGCA 
GTGGCTGGTC 
GAGCTGACCA 
TCCAAGAAAA 
TTTCTTTCCC 
TTCCTTAGTT 
TAATTAAAAC 
ACCACTGAAT 
AAAAAAAAAA 
CTTGATGCCA 
GGCCTGCTGC 
AACATCAGAT 
GTTGAATTAA 
TCTGTCATAA 
AATTAGTTGT 
ATAAACAATT 
TCTTCCATCC 
GAATGACCGT 
CCCTCCTCTG 
GAGCATATAT 
AGTCAATTTG 
Al I I I IGGGG 
AATCTATGCT 
GGTCTCTGTG 
ACATTCTGTG 
TGTTCGCCAT 
GGGAAATCAT 
TCTTTATGTG 
AAACCAAGCA 
GTAATGGCTT 
AATCTGCAGA 
GGCAAATGTT 
CACTTCCTTT 



ACTTTCAAAT 
TATTGA TTAA 
I I I I IATAGT 
AGTCTCCAAC 
AATTACAGAC 
GGGGTGAATG 
GTGAGAAATA 
GAGCTATTTA 
CGTGTGCTGT 
AACCCTGTCA 
TTATAAAAAA 
ATTTCTAAGG 
TGTGGCCCAT 
TGCCTCCCAG 
GATTACAGGC 
GAGACGAAGT 
AAGTTATCCA 
GCCACTGTGC 
TTTTATCTTT 
CAAACCCTCA 
TATATAAAAA 
ATGGCAGCAG 
CTAGGTCCGT 
GCACAGCTCT 
CCAGAATCAC 
TCCCAACTCT 
TCATTCATTA 
ACATACGCAC 
TACTTTCCAG 
AAGAAAGAAA 
GGGGGAGAAT 
TTCACCTTTC 
GAGATAAGGA 
TAAAATCATC 
CCCTGTACTC 
AGTCTCTTGC 
TTTTAACTAA 
CAAAGTATAG 
TAGTGTGAAG 
ATTCAGTTAC 
TTAGATGTTT 
CTAATGAAGA 
AAAAAGAAGC 
TTCAGGCAAT 
GGAAAACAAT 
TTTATTCAGG 
ATGGCTGCAT 
TCTAACCCAA 
ACTCCCCTTG 
CTGCATTCCG 
TACTCCTGTA 
TAATGTGAGA 
TGCTTATAAT 
GGAATGTCTT 



TCTCAATTTT 
AATAAAGATT 
CCTCAGGCCC 
TGTGAACATT 
CATTAATGTA 
GAAATGGAAA 
TGGGGAGAAG 
TGCCTTGAGT 
TTCTATCTAC 
TCCTCTTTCC 
TAGCTTGTAC 
TTTATTTATT 
GCTGGAGTGC 
GTTCAAGCGA 
GCCTGCCACC 
TTCACCATGT 
CCCACCTCAG 
CTGGCCTCTA 
CTTTATCTTT 
ATTTGAAAGC 
CAACTAAAAA 
CAGGCCTTAT 
AGCCTCAAAG 
ATAACCAAGT 
AACGCTCTGT 
TTTGCCAGTT 
TCTGTTAATA 
ACACAGAAAC 
TAAGAGATGT 
ATACCAGCTA 
GGGGTCTCCC 
TGAGCCACAG 
TCACAGTGCC 
TGATTATGGT 
TTTCTTTGGA 
ATAGCTAATG 
AAATAATATT 
TTGTAAATGG 
GAGGAAAAGA 
TTAGATCACC 
TCACTTTCTT 
CACTGAAAGT 
TGGCAGATTA 
AAACGGGACT 
GTGACTGAAA 
CTCTTACTGG 
GAATCATTTA 
AGGTAATCTA 
TAATTTGTAT 
TTGAAAATTA 
ATGTTATCTT 
ATAAGGATAA 
ACATCCCTTC 
CATTTAACCT 



AAAAACTACT 
CCAGAGTTTA 



ATTTTAGGCT 
ATGTACCAGT 
AACCACAAAA 
GAGGCAACAA 
GCTAGGAAAT 
TCTAGAAAAG 
ATCTCATACT 
TCTCCAGATT 
AGATCTCCTA 
TATTTATTGA 
AATGGTGCAA 
TTCTCCTGCT 
ACACTCGGCT 
TGGCCAGGCT 
CCTCCCAAAG 
GGATTATATT 
CTTTTCATGT 
ACTTTTAAAA 
GTCTGTGACA 
TAATTGAGCC 
GCCCTGGCTT 
TGTACATCTT 
GGATAGTGAC 
CATTGAATTG 
ATCCATGTAC 
AACCAACACA 
ATGTATAAAT 
CAGGGCCCTG 
CCTGGGTATG 
TTCCCTATAG 
TAGGCATTTA 
ATGGTAGTAG 
AGGGCTCTAA 
TGAGAAAGAG 
TCCTTCCTTT 
AACTCAAAAT 
AAATTGGGGT 
TGAAACATAC 
ATTTGTGTGT 
CAGAAATTCC 
TGCTGATGAA 
GAGGGTGTCT 
TTTTCCAAGC 
AATAAGGGCT 
TGAAACTTAT 
CAATCATACA 
TTTTACTGAG 
CATGCTTTTA 
AGTCTTCAAT 
CCCCTAAAGG 
I I I I ICAAGC 
GAGTAATGCC 



ATGGAATACA 
AATTCTAGGT 
TCATTTTATC 
CTTCACATAG 
GAGTGGTGGG 
CTGAGGGCAT 
GTTCTTAACT 
CACATACAAC 
GTTTTCTATT 
ATTTTCAGGA 
CAATAATTAT 
GACAGACAGA 
TCTCGGCTCA 
TCAGCCTCCT 
AACTTTTTGT 
GGTCTTGAAC 
TGCTGGGATT 
AATAGAACAA 
AGGAAATGTC 
TCATACATAG 
TTTTGCAGTA 
TCTTGGAAAT 
GTAACTGCAG 
CTAGCCTGTG 
ATCTTAAAGT 
CTTTAATAAT 
ATTTTGAGAG 
ACACACAGCT 
GATTGTACCA 
CCTGGGACTG 
GGTGGGTATG 
GGATATTTTG 
ATAAATATTC 
TTCAGAAAAT 
ATGGGAACAC 
GGAATGTGGT 
ATAACATCCT 
TGTTGGTCTG 
GTCTTATTTC 
ATATGATTCA 
GTGTGTGTTC 
TGACAAATGG 
CACGTAAGTG 
GATCTACCTA 
CTTGATCAGC 
TGI I I I I ICC 
GTGTTTTGGG 
TGTTTTCCCT 
GCCTCTGCTG 
TTGATGTTGA 
TTTGGACTGT 
TATGCCCTTT 
ATCCCGGAAC 
ATCATGGGCA 
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8101 GTGGGATCCT GGGCTTGTCC TATGCCATGG CCAACACAGG GATCATACTT 

8151 TTTATGTAAG TGAATGTATA TGTCTACATT TGGTGATGAA GTCCATGCAT T <b <f 
8201 ACCTGGTGGC I I I I ICAATT AACAATCTCA AGTTTGATCT TTGTGAACGT ^fciAB^ 9 
8251 GAAGACTCAG AGGAGGCTAA TCATGGCACT TGGTCACCCA ACCATCCCTA 
8301 ACCCAACGGC AGAAAGTGTA TGTGCTCAAT CAACCAAAGT GCTGGAGCAG 
8351 CCTCGCCAGA AGAATTTTGT TATTCAGTAA ATACTTGAAA TAATTTGGTG 
8401 TTTAGCAACC AAAAAGATCT TTCCCAGAAG CAAATCTGAT TTTATCTCAT 
8451 TCTTAGGAAA GAAGCAACCA AGCCTAAGAG CCCTGCATGC CCTTGCCTAC 
8501 CTTATGTCCC ATTCCCTGTA CCCCTGTGCG ACAGATACAC TGGGCACAAT 
8551 AGCCTTCTCT CCATCCTATG AAGATGCCAC ATTCCCTCTC ACCATTGGAC 
8601 CTTTGCACAT GGTCTTGGAA CCCTCTTCTC TTCCTTCTTC ATCTAGTTAA 
8651 CTCCTCATAT GTCAGTTCAG TCTCACCTGA ATACTGCGCG CCCTGATCTC 
8701 CATGACTGGG GCAAATCACC TTATCATAAC ACTCACCACA ATTTTAATGT 
8751 TTTAGTGCCA TTTGTCTGAT TCATTTGGTT AATATCTGTC CCTCTTGCTG 
8801 GACTATAAGC TCTAGAAAGT TGAGCCCATG TCTGTTTTTA CTCACCAATG 
8851 TCTCTACCTC CAAACCTAGA GCAGTGCCTG GTACAGGCAA TATTTGTTGA § 
8901 GTGACCAAAC CTTATTCCTA AACCTACGTA CTTTCACCAA ACTTGTTCAA q Rj 
8951 ATGCTGCCTA AGGGTAGCAG CATCTGGTAG TTGACCTGTA GGGTGGATAC m S g 

9001 TGCACTGTCT ATGACAGACA ACAACAGACG TTTATGTGCA TCATGTACAG ^ S $ 

9051 CCTGGCATTT TCCAGGATAT AGTTGGCAGC AGTGGAATTC TTCACAAGAA — o-.cc 
9101 TAAAGTCTGA TGTTAGGCAC CACTGTGGAC ACAGATCCTA ATCCCAAATG LLJ 
9151 CAACGCTAGA GAGTTAAATA ACTGTCTAAG AATGCAACAT TTATATCACA O _ 

9201 AATATGTGCT GTTTATGTTC TGAATATCAC ATATGATTAG TAATCACACA LLl ^ <^> 

9251 GCTATTTGAG GGCTAAGCAT CAGGACTATA AATATTTGTA TTGTGTTAGT CC 5 

9301 GCTTTGATTG AACTCTTTTA TGTATAATAT TCTTCAGCTG AATGGGTTTT 
9351 TATATCAACT TTACTTTTAT ATAAGCCATG TTTTGAAATA AACTAGGATT 
9401 TTAATAATCT GAATTTTAAT AGCTATGTAT GTAGTCATAT ATTTGTATGC 
9451 TTTTGTAATG TGOTACCTC TAAGACAAAA AAACCTGCCT TTCCTTATTA 
9501 ATTATACATA CCATTAAAAT GAATTAGGAA GTTACAGATC ACTGATGAAT 
9551 AGAAATAGGA AAAACTTCCC CCAATCCCAC AGTCATAGAT CATCTTCATG 
9601 AGAGAAGAAT GTTCCACTTT TTAAAATGAG GGCCTCATTT TAGGCTTATA 
9651 AACACTTAGC AGATGAATTT GGTCAGAACA ATTAAATCAC TAAACATCAT 
9701 GGGGTGTGTT TTGTGTGTCT AAGTAGCCCA GACTGGATTA AGCTTTCTCT 
9751 CTTAATTTAT AGCAAGTGAC ACAGTATTTT AAAGGTTTTA CTCTTAGTAT 
9801 TTTCTGCCAG AGAAAGTACA TGTTTAGAAT ACAGGGAATG CTCATTATTT 
9851 TTCCAGGGAA CAAAATTATA TAATCTGAAT TACATTATTC CTTAAAAACA 
9901 GTTAAGTTCA TAAGGCATAT GGAAAAATAT AGGAATAAGT CATTGGTTAG 
9951 ACAGTTCTGG CAAACATACT CTATGGAAAA TAAGAGTGCA ACATAGCTAC 
10001 AGGGGTTATA AAATTTATAA TTCATGGTCC AAATGTACAT TTGTAGTATT 
10051 GATTTCATTG GGAATTACCA AGGGATTAGA TCAATTGTGG GGAAAGTGTA 
10101 I I I I I IAAAA ATAAACAAAG ATAAAGATTT I I I I ICTGAA TTCCAGGTAA 
10151 AAGGCAGCAT TGCTCCTCCA TTTATTACGT AGATGCTTCT ATCAACATTC 
10201 TTAI I I I IGT GCTCCAAATC TTGGATTTGG AAAAATACCA ATCCGTATAA 
10251 ACATAAAGAA ACCATACATG CATGTGGGGA TCCTAACACC AGAAATGACT 
10301 CTGAATGCAA AAAAAAAAAA AAAAAAAAAA GGGAATTTTC GTGCCCCATC 
10351 CTTAGCTTTC TCTGCTTTCT CTATTATATA TGCAACTGCC TGCCCCTCTA 
10401 TCTTACAAAG TACTTCGTAA TCTAATGCAC AGGATCAGCA GTAATGCAGC 
10451 TCAGACTGCA TGCTTTCGCC TTTGGATTCC TAGATTTCAG ATTAAGGTTT 
10501 AGTCAGGCTA TTGAATAGCC CTTCAATTCT AAGTGCTGAT GTGAATATCA 
10551 TGCAAATATG ATGTACATAT TCCCATGTGC TGAGTAAGTA GATGTAGCAT 
10601 TTGCTAATGT TGCTATACAT TTAGCATCTA AGTTATGAAC CAGATTCTAC 
10651 CACTGGGTAA CATTAAAAAA AAGTTAGGGA CTTCAGGTAT GTAAAATATA 
10701 GCAAATTCTA TTTCTACGAC TTTAAAGGGT ATGTGTAGAG TTCTGAAAAG 
10751 AATTTCTCAG CCTCCCCCAA ATCCACATAC TTTTGGAAAG CTGATGATTG 
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10801 AAAAGATTAA TGTGATCCTT TATTGTAACA TCTAACATAA TTACATTTTA 
10851 TTTATTGTAG AAACTTTATT ACCTACTCTC TCTTCCCTTT GCAGAATCAT 
10901 GCTGCTTGCT GTGGCAATAT TATCACTGTA TTCAGTTCAC CTTTTATTAA 
10951 AAACAGCCAA GGAAGGAGGT ATGCTACCAC TTGAGTCCAA CACATTCTAT 
11001 TTTAATTCTC ATAAAAGAGT ATTTCAGTCT GTTGCTTCAT AACCTTAGGA 
11051 TGATTATAGT CAGTTTCACA TTTCATTTTC TTCTGAGCCC AGTGACACGA 
11101 TCTCTCAGTG TTTATAGTTG TTTGGGCAAG TGAGAGGCAG GAGTGAAAGT 
11151 CAACTGGCrC AGGTTCAAGA CAAATAGAAA AAAGAAATTT CTGATATATG 
11201 ATAGAAATAA CTGTTTTGAC TTGCTACATG CAGCTAAAAT AAATAAAACC 
11251 ATTGATTCTT GTTTGGAGAA CATTTTGATA TATTGCTTAT TGGTTTTTGA 
11301 GGTTGCATCT TTTGGGCTTA TAATTTCTAT ATGATGTTTA TTTACATGTT 
11351 TGAGACTCCA GCATGGAATT ATATGACAAA AATATTTTAG TCATTAAAAC 
11401 AATCTCTTTA ACAAGGCTAT TTTATCTTTG ATTGTAGGGT CTTTGATTTA 
11451 TGAAAAATTA GGAGAAAAGG CATTTGGATG GCCGGGAAAA ATTGGAGCTT 
11501 TTGTTTCCAT TACAATGCAG AACATTGGAG GTAAGGGGAT ATACTTTCCA 
11551 ATGGATCCCA TAAACTTTCT ATAGCGTGTT CAATAAATAA GAAAACTTAT 
11601 GGCAATAAAC AGGCACTTTA GATACAGAAA AATTGCTACT TATAGTTCTT 
11651 AAATTTTAAA ATGATAGTTT CTTAAATAGG TTTGTGTCCT GCTTTAATTA 
11701 AAAACAGCAA TATCTAAGAA TGAAATAACA TATAAAACCC TGCCAATTGA 
11751 ATTCTAGAAT TAAAATATAA AATAAAAGCT TTCTTGATTT TTAATGTTAT 
11801 TATAGCATGA ATTATTACTC TTAAAAATTG AAGAATTTGT GCTTATATCT 
11851 GTCATTGACA AAACAGTTGA CGTTTTCTAT GTGTGACTGA GTTCGATTTA 
11901 CTAAACTGAA AAGTGGGTGT CTGGGGGAAC ATAGCCAAAT GCTGTGGTCC 
11951 TTGAAACGCA GCCTGCACTG AGCCAGCCCA CTAGACAGTG TCTCTGGAAG 
12001 TTTACTAAGG CAAAAGTCTG GCTAGGCATC AAATGCACTA TAAACCCCGG 
12051 TTTGTTGATT CTATGGATTC TTATAATTCC CACTGAATTA TCATTTCCAG 
12101 TGTAGGACCT AGAAATATAT ATATATATTT TTAACAATGT TCTCTCGTTG 
12151 GTGTGTTTGC CCACCAGCTT CATACTGTTT CTGTTGTGTC TTTGGCCCTC 
12201 AGAAGGCATC CAAACCCATA TTTCAGATGT CCTGCCGGCT GCTTCCTGGC 
12251 ACATGGCCCC AGCCATCTCC CCACATAATG ACACTTACTC CCTCACCTCC 
12301 TACCCAGTCC CTAAACCTGC TATTCTATTT CTCTGATCTT TCI I I ICTCA 
12351 GTGAATACCA CCAGCAGTCA TCCAGTTTCT GAGGGCAGAA ATCTGGATGT 
12401 CAGCGTAAAT GTTTCLI I I I CCCCAACTCT GCATGTCCAA TCAAATGGCA 
12451 AAGTCTGTTC ATTTGATCTC TTACTTATCT CTTGAACCTC TCCTCTCTGT 
12501 CCGTCCTCAT GACCACAGAT GATCACCATT TATAGCTCAG ACTATTGCAG 
12551 TAGTCTTCTA ACTGGTCTTC CTGGCTTGAG TTTCCCCTGC TCTCAGATAA 
12601 ACTCTA ATTT GTTCTCCAGA TAAACTTTCT CAAATTTGAG TCTGTTTCTA 
12651 CI I I IGTCGT GCATAAAATT CTTCAGCATG CCTTTATTAT TTTCAAGGAA 
12701 AAACTTAAAC TCATTGGACT GACACAAGAT CTTCGTCTAG TTCTTCTGCT 
12751 CAATCTTTCT AAACTTTCCT AGCAATGCCC ATATCTATCT ATCTTTATCT 
12801 ATCTATCTAT CTATCTATCT ATCTATCTAT CTATCTATCT ATCATCTATC 
12851 AATTTATCCA TCATCTATAC CCTACATGTC CTGTGTCAAA CCATAACAAA 
12901 TTATATTTAT TCCCCTAACA GTACTATTTT AATAI I I I IA AAAATCATCC 
12951 ATGCCI ICI I TTCACAGGCT ACTTTCTCCC CTTGACTGTC TCTCAAAGTC 
13001 CTCCAACCCT AACACACACG CACACACACA CACACACACA CACACACACA 
13051 CACACACATT TTCTCTCTCA CTCTGCTCAC CTGGTCTATT GCTCCTCTAG 
13101 ACTGGTAAAT ACTAGTTCCT CTGGGCTCTC ATGGTCCTGT TTGTATCTAG 
13151 TATGTTACTG TTTTCTAAAG GATATTTTAA AACACTTGAG TAGAGAATAA 
13201 GCI I I IGGAG TCTGATGGAC CTGAATTTGA GTCTGTTTCT GTCACTATCT 
13251 GTGAACTTGG GAAGATCACT GTACTCCTTT GTCTGATTTT TTCATGTATA 
13301 AAAATTACCT TACAAAGGCT ATTGTGAGGA TGAAATAAGG TAACATATGG 
13351 CACATAATAA GTGTTCTGTA TATGCTTCTC TCCTCCCTGG TTCTCTGCTT 
13401 CCATATCCAT GTCTCTGGAG TTGCCTGAAT TAI I I I I IAA ATAGGCATTT 
13451 AAAAAATTAT AAAACAAATA TATGATGATT GTGAAAAACT AAAACACTGC 
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13501 
13551 
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13651 
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13751 
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ATAAATATAT 
AACTACTCAT 
TTGTTAGTAT 
ATTTAACAAT 



TTTGAGGTTT 
ATTGATGAAC 
ACAGTAAATA 
TAAATAAATT 
TTAGTAATCT 
AGATAAGTAA 
ATATGTTAAG 
GTCTCATTCA 
TCCCAACAAA 
AGGTTTAAAG 
TTAACAACTG 
AGTTGAGTGG 
CTTCTAAACA 
ACAGGTAAGA 
ACCCCAGAAA 
TTTGGCATAA 
TAGAACACTT 
TGI I I I CACA 
CCAAACATGT 
GTAACATTGA 
TGTTCACACT 
ACCAAATAAT 
CTTTGACAGT 
TCCTTTTCAA 
TTTTATATCT 
TCACAGGGTT 
TGTGGGGAAT 
ATTTTTAAAC 
GCCTGTTAGC 
CATGTAAAAT 
CAACAAATTA 
GATGCATAAA 
AATACATATT 
TCTTTCACAA 
TGAAATTTTG 
GATGAAGAGG 
TGTCATCTCT 
ATAAGTGGTC 
GATTGGTGCC 
TCTGCCCTTT 
AGACTACTGC 
TATGTTGATT 
TGTAAACTTT 
CAAAGGTCCA 
AACTTGGAAG 
ATTAGATGTT 
ACTTCAGTAC 
TGTCAAGCTA 
GCATTCATGG 
TGACATCAAG 



AAATTACCAA 
AGGTTTTCCC 
TGGATCATCT 
ATGCCTTGAA 
CCATTACACA 
AGTTGGATTG 
CACTTGCACA 
CACTGGAGTT 
ATCTAACTAT 
AATTCAGCCA 
ACTAC AATCT 
TTTTACTGGT 
TAGACCACTT 
TATCGAGCAT 
AAATAACCAG 
CAGCACCTCA 
ATGGAATAAA 
GGCGTGACTT 
TAATGCTCTC 
GAGAAATCAC 
AATTTTCTCA 
TTAACAGATT 
TAACATCACT 
ATGGTACCTC 
CTGTAAGATT 
GTGTACAGAA 
ACAGAAGAAG 
GCTGTATGAT 
TCATCAGTGG 
TTTCAGATGA 
TATTATCATC 
TCAGTAGTTT 
AAAAATTCAC 
TTGTGAGTTC 
ACGTTATCCT 
TATTTACAGT 
CAI I I I I ICA 
GAGTAGATAA 
CCACTTTATA 
GGGGAAAAAA 
TGTATTCTTT 
TGAGACTTTG 
CATCTGCAGA 
GATTCTGCCA 
CCAAGTTATA 
TATAACTAAT 
GGAGCAGGAT 
CCTGAAAGAG 
CAGTGCCAAA 
CATCTTACAT 
TGTCCACACC 
CCTCTTTATC 
GACTTGAAGA 
TGACTCATTC 



GAAAAGTTTA 
CTATGCCTAA 
TATGATACCG 
TATATTTGCA 
AATGTGCCAT 
TTTCTAATTT 
GAGATCTTGC 
ATCAAGGATT 
ACTCACCCTG 
TATCTTATGA 
TGGGTAGAAT 
TTGGAGCCAT 
TACATTTGTT 
TTCATAGGAT 
GCATAAGCAT 
GCTGTGGTTC 
GTATAAAGCG 
TAAGGGAGTA 
ACTTTGGTTA 
TTGTATTTCT 
ATCCCCTAAA 
AATGTCTGGA 
GACATCACTG 
ACCAAGCCAG 
TAGCTTTAGC 
GGTACATCCG 
CACCAGCCCT 
TGTGGACATG 
AATGAGAATA 
TCAAATGAAG 
ACTGTTACTT 
AATTTACAAT 
TAGAAGGATG 
TAI I I I IAAC 
TCACCAAATG 
TGTAGGCAAA 
GGGAGAAGGC 
TCATTCATGT 
TAATTCCTTA 
AAAACTTCTT 
TATCAATCCA 
TGTACTCCTC 
AAACCAGAGG 
TCTCTCAGCT 
GACACTAACA 
GAGGGCAGAA 
TTAACACAGA 
CTGGAAGGTC 
TACACAATGA 
ATCCCAAATG 
ATTAAGCTGT 
ATTAAATATG 
AAATACTGGG 
TACTTGGTCT 



TGTCAGTCAT 

TTCAACAAAT 

ATTTTCAGCT 

TGTTATTCTT 

AATTTGTTTA 

TTCACTGTTA 

AAACAGGCAA 

TCTGGAATGC 

ATAATGGATA 

TTTGTGTTAA 

TTGACAGTAA 

ATGCATATTA 

TCAAACTCTC 

TGCCTTATAG 

AATTAACCCT 

AAAGCATAGC 

GTCTCTCAGT 

AGATGAAATA 

CTTTATTTGA 

CTATTTAACA 

AAATTAACAT 

TCATTCTGAA 

AAAACCAGCA 

CTAATCAGAA 

CAAGGTCTTT 

CTATTGTAAA 

TCTGTTTTAG 

TCAACTTAAC 

ACAACATATA 

TAATGTGCAG 

TCATATGAAG 

TTAAGTATGT 

TAGGACACAC 

TGAATCI I I I 

GGTGGGCTTG 

ATTGTAATGT 

TTGTAGATTT 

ATCACTTACC 

GTTAGCCAAA 

TGACAAAGAT 

GGAAGCCTTT 

AGATAGGTCC 

GGATATATTG 

GGCCCATGCC 

CAGGCACACT 

CCTTAGAACT 

ATCAGCCCTG 

AAATGTCTAT 

LI I I II I I IC 

TCATAACTTG 

CACATTTTCC 

AACTACCTGA 

TATGTCTTAT 

TTTCTGATTC 



CCTCAGAAAT 
ACATTGAATA 
TTCTTTTTAA 
TTTAATGATT 
CAGTATCCTT 
TAAAAATGCT 
CCCATTTTAA 
AGAAATTTCT 
GTTGGTAAGC 



^ 

n 



AAAAAI I I I I 
TATCAAAATT 
GCCCCCCAAA 
AGCCTTATCA 
TTGGTCTAAT 
GGACTCAAGA 
CACTACTACG 
CAAGCCTCAC 
TCGTAACATC 
TTAGTTGATA 
ACTCTACATT 
TTACTGCAGA 
I I I I IGAAGA 
ATTAATAGCT 
ATATCTCCTG 
GCAAAGATTA 
AATCATTTCA 
ATGTAGTCCG 
ATCTCGGAGT 
TCTTGTCATC 
AACTAACCAA 
TGAAGAAAAT 
GTTTTAAAGT 
TTAAAG I I I I 
GGCCATGTGT 
AAAAAGGCGT 
TATGTATATG 
CATCAAGAAA 
TAGATGCTCA 
AGGAGAGTAA 
GGAGAGAAGC 
GGTTTTGACA 
CGGAGGACTA 
ACTCTGCAGA 
TTTTGTTGCC 
GAGTATGGGC 
GCAGCTTCAC 
ATACTGTTAA 
CTTGGAAGAG 
CATTTGGGGG 
CTTGCATGTG 
ATTTTAGCAA 
AGTAATCAGA 
GCTCCCTCTG 
TAATATCCCT 



LU 
> 

LU 

O 
LU 



cvj CD 



FIGURE 3F 



Docket No.: CL001010 
Serial No.: 09/776,705 
Inventor: Karl GUEGLER et al. 
Title: ISOLATED HUMAN TRANSPORTER PROTEINS.. 



16201 
16251 
16301 
16351 
16401 
16451 
16501 
16551 
16601 
16651 
16701 
16751 
16801 
16851 
16901 
16951 
17001 
17051 
17101 
17151 
17201 
17251 
17301 
17351 
17401 
17451 
17501 
17551 
17601 
17651 
17701 
17751 
17801 
17851 
17901 
17951 
18001 
18051 
18101 
18151 
18201 
18251 
18301 
18351 
18401 
18451 
18501 
18551 
18601 
18651 
18701 
18751 
18801 
18851 



GTCTCTCACT 
TTGTGTCTGT 
AAAGATATTT 
AGGTTAGCTA 
ACTTCTAGGT 
TGI I I I I IGT 
GGTTAGCATG 
GCACTTCACr 
GACTAATTTA 
TAGAATATCA 
TATGGGGAAA 
TTTAGAACAG 
ATATTCACAT 
AGGACAAGAA 
ACAAAAGAGG 
ATTTCCCCAA 
TCACCTTAGT 
CATGAAAGAG 
AAGAGGCAGG 
CTAGAGGAAG 
TTCCCTCA CC 
GTTTTTTCTC 
GGTGATTTAC 
ACAGTGTTGG 
ATGTTACCCA 
CACCCACCGC 
TTCATGACAG 
CCCAAATACT 
AATGAGTACA 
GCCTGAGATC 
GTTTTTCAGA 
CCACCCTGAG 
ATCATTTTAG 
ATTCTTTAAC 
TACTCCATCC 
GCCCACCACA 
ATGGTTCAGG 
AATAATTTTG 
AATGATGTCA 
ATGGTTAGGG 
CTAACCCTTG 
CLI I IGI IGT 
TOTCCAGCT 
TGTTGATATT 
CCTCTTTCCT 
TCTAGACATC 
CCCCATAAAT 
ATTTTGATGT 
GAAGCCTGAC 
CAGAAGCTTT 
CTACTAGGTT 
GCTTTCTATG 
TCAGTGGGCA 
GGAAATGATT 



TCTAGAGAAT 
TGGAATTATT 
TCTAACTGGA 
ATTGTAGATG 
TATCTTGGCT 
TAGTGTGGTA 
AGTTTTTTTG 
AATATGAAAT 
GAGCAAAAAA 
AAGCTACTAC 
AAAGGAAAAC 
CAGAGGAGAG 
GAATCAGCCA 
CAGTGGGATT 
GAATTTCTTG 
AGAGGTAAAG 
TTGAAGAAAT 
GGTGGGAAGG 
GCTGCCAATC 
TTTACTTTGT 
CCCCACCCTA 
AAAAAGCTGA 
AAGAAATTCC 
AAATCTGTCA 
ACAACTCTGA 
AATCCTGCAG 
TGGAGTAGAA 
TTGTATTCAA 
GTTATGTGTT 
ACCATCTATG 
CGGCCTATGC 
GTCCTTCCCA 
CATTCTAATT 
AGGAAACACA 
ACTCCTCAGT 
GCACTACCTT 
GGAGGGGTTC 
GCAGGTGTTG 
GCTGGACTAG 
GCAGCCTCGA 
TCTAGAACAC 
CTTATTTGTA 
GATGGTCTAT 
TGCTTGGGAA 
TTAAGGGAGG 
AGTAACCTCA 
TCTGATTTAC 
CTTTGATTTA 
AATTGATCTT 
TCAGGTAGTC 
CTTTTCCTCA 
TTAGGAGCCT 
AGCTTACTGA 
GTTTTTAAGA 



GGTACCTCAA 
CTTCCACTTT 
AATAI I I I IA 
CCATATTCAC 
ATACCAGTGG 
AGTGATGTGA 
TGCCTAAATT 
AGTTCTTGTA 
AGAGCAGCTA 
TAATGGTTTG 
TTCAAGCAGG 
TCCCAGATGA 
TTCTCTCTTA 
AAAGAAGAAA 
CACTATCATG 
GAGAAGGAAA 
AAGGAAATGA 
ATTCTTGCAA 
ACAGCTCCAA 
CCATTAGTCT 
TAAGCCACAC 
I I I I I I I I IC 
AAATACCCTG 
TTCAACAACA 
GAGTTCTGAT 
GGCTGGATGA 
TATGAAGCTC 
CTCCCGGGTA 
TTCTAAGTTT 
TTGGAATGCT 
AATTCCTATC 
TCTACAGTGA 
TGCTTTGAAA 
GTTTATAGCT 
AACATGCTTT 
ATCTTCTTTC 
ATGGCAGGTG 
GGAATTTAAA 
AAATGAAAAC 
TAAGCCAGTG 
ATTCATTACA 
CAATAGAGTC 
GGTTCATTTG 
CAAAGGATAT 
ATATTATATA 
GTCTTCATTC 
CTCATAAAAA 
CTTTGAAAAT 
TGGCAGCCAG 
TGGACTCCTG 
TTTAAAGTCA 
GTTTCATCTT 
CCTACGTGAA 
CTGAAGGACT 



TGGCAACTAC 
CGCTCCTTAA 
I I I I I ATTTC 
CTTCCA AAAT 
ATTTTCTCTT 
TGACATGATC 
AGTGTCCTCA 
TCACAAGTGA 
CGATTTAAAG 
GTCTAGGCAC 
AACATGACAA 
GAAACAAGAA 
CACATTCCAC 
TCCTCCTCTC 
AATGCCAAAA 
AAAAGTTTTG 
TCATCTTTCT 
AATATTGTCC 
CTCTTCCCTT 
AAAAGGAATC 
ATATGGATTC 
I I I I I IAATG 
CCCTCTACCT 
CGCTTCCAAT 
GTGAACTTCA 
GAACCAGGCC 
ATAGTGATGA 
AGTGAGCGGT 
TTATTCAATA 
AAACACGTGG 
CTAGTATTTG 
ACTTAAAGAG 
TTCTGCTCAT 
TCCTCTTCAG 
AATCAGAAAG 
TCTCCTTTCT 
GACAAGGAGT 
TTTGAATTTT 
ACCCATGACG 
ATGTCATTTA 
AGAGATGTGT 
ACTGGCTAGA 
TATTLI I I IC 
GAACTCATTA 
ATAATTCTCA 
TCACTAAATA 
ATTTCAGAAC 
TACATGTAGC 
GTTCCTTCTA 
GCAGTAGTAC 
TCTCATTATG 
TATGTTAATT 
ATAGACTGTT 
AGTGTTTAAG 



CTCATCATAT 
AAATTTAGGT 
ACATTTAAAT 
GCTTCTTCTA 
ACCTGCATGG 
CTTGCAGGTT 
TTTTGTTCAA 
I I I I C_ I IGTA 
ATAGTTGAGG 
ACTGGTTATA 
TAATCTGGCA 
GGCTATATCC 
CCATTAAGAG 
TAGGCCCCTG 
TTTATAAAGC 
AAGACCCATG 
CATGGAAGGG 
TGTTAACTCT 
AGAACAGAGG 
CCTAACTGAG 
TTATTTCATT 
ACTGAGTCTA 
GTTTTGGATC 
GCATGTGGTA 
TGATGGATTA 
AAGGGCTCTC 
CAAGTGTGAA 
CCGGGCTTCT 
AACTGAGAJG 
TGTTGTCnT 
CI I I IGTATG 
TAAGGCAGCC 
ATGTTCAAAG 
AGAAAATATG 
GTGGGAATCA 
CTCCACCATA 
CGATGGTTGT 
GTTCGGAAGA 
ACCAAAACTT 
TAGTCAGCAC 
CAATATCTGT 
AAATCTTGTT 
CCTTTGAAGT 
TAGCTG I I I I 
ACTTCTTTAA 
GCAAAACTTT 
ACTTTCAAGT 
AGTTACTCCA 
GAATGG I I I I 
TTTGCTGACT 
AAATGCAAAA 
ATATTCTTAT 
CCTCTTCTAG 
AAAAATGGAA 
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18901 
18951 
19001 
19051 
19101 
19151 
19201 
19251 
19301 
19351 
19401 
19451 
19501 
19551 
19601 
19651 
19701 
19751 
19801 
19851 
19901 
19951 
20001 
20051 
20101 
20151 
20201 
20251 
20301 
20351 
20401 
20451 
20501 
20551 
20601 
20651 
20701 
20751 
20801 
20851 
20901 
20951 
21001 
21051 
21101 
21151 
21201 
21251 
21301 
21351 
21401 
21451 
21501 
21551 



ATGAATCCTC 
TGTACTAAAT 
TTATTTTCAT 
CTTGGTGTTC 
GTCAAATATT 
TCTTTGGTTA 
ATATGCAAAT 
TAAAAATTGC 
ATATACTATT 
AGCCTGGCTA 
AAAATATCCC 
TCTGGCTAGA 
CCACTTGCAT 
AATAGCAGGG 
CCAGTCAAGT 
TATCCCATTC 
GACAGAATTT 
GGCCACATTA 
TCACAAAGGG 
CAGATTTCAT 
CAGTGTATCA 
TAACTTAATT 
AAATTGTGGA 
I I I I CTATTT 
GGGCATTAAC 
TACTGAAGGT 
ACTACGCTTG 
GAATGAAGGT 
CACTGGGAGA 
ATACTTATTT 
AAAAATATCT 
AAAATGGTAG 
TGCATCAGGA 
AGAGACAATA 
TTGGTCTCTG 
TCACAATGTA 
GCAACGTTAT 
TAGCTAATGA 
AGGTCTTTTC 
ACCTGTGCTG 
GTAGGAGAAG 
ATTAGACATC 
CACTAACTGT 
GAAAGAAACC 
TTTGATTTAA 
TTTTCCTATA 
TAGAGCAAGT 
AAAACTTATT 
TTGTGGATTA 
AAGCCAAAAC 
TGAAGGCATT 
AAGTTGCCTT 
ATGTTAAGTA 
CTATCTATCA 
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ATTAGCTCTC TAAGACAAAT TTAAATCAGC TATAAGTT 
ATGTCTTCAT GATTAGCAAT ATAGATATAC 
TTTGAAAAGT GAI I I I I I I I TGTAAGTTTA 
TTTCTTTTTC CAGTCGGTCC CGGAGAAAAA 
TCCATCACGG GGATGCTTGT CATGTACCTG 
CCTAACCTTC TATGGTAGGT CACTCTGAAA 
CCTTGTTAGG CTGGTCCTTG ACCTGGGTAG 
CTTCTATAAG CATGCTCTAT AGATGACACA 
TTAGTTTTGT CACTTGACCT GAGGAAATGG 
ACAAGTTACA AGAATTTGTG AATTAACACC 
TCAAACAAAA TTATTTTCCT CTAGGGATAG 
CTCCATAGTC CAACTCAGGC TACAAGTGAT 
GTGATAAAGC TCCTTTGATG GAATTATTAA 
AAACTGCCAG GTCCTCAAGT TTGAATTTGC 
CAAATCTGGG AGCTTGGGAC TTTAGGTAAA 
TATTTTGTTA TACT AAATGA TTTCCTAAGA 
CCTTCAATCT AAGAATGCAC CACCAAAAAA 
GATTATGCCT GCAACATTTC CTCTCTGGCA 
AGTAGGATTG TACTCCTTCC ATGAAGTGTG 
GGAATCACAT ATTGACCTGG TAGCATATGT 
ATATA AATAT Al I I I IGTAT AAACCTCCTT 
TTTTTCTTAC TGACTTGGTA AATTGAATTG 
GGAAAAGATT CAGGAGTAGG CCACCATTTG 
GACTAATATT TGACTATTAA CCAAACATGT 
I I I I IGCCGG TTGTGAAATA ATGAATGACG 
ATTTTCACTA LI I I I IGTCT GATCTTGAGG 
ATTCCATAGA TATTTTCTTG TTATTTGTGC 
G I I I I CAAGT AGGGCTGCAT CTTCGTCTTA 
CCATCTAAAA ATTATACTAA TTTATCCCTG 
TAATGAGTTT CATAAGACAA GCAAAAACTT 
GTTTTAGTGT GGTGATGGAG TCATAGTTGT 
CAATCATTCA TCCTAGAGTT TACACACTGG 
GTGGCTGCAC AGGTAGGGAC AGGGGAGGTG 
TGTGGGGCTT GGGTCTCTCA TCCCCTTCAA 
TCTGATTTGT AATTGCTTCT GTACAGCGGA 
AATGAGCTTG AGAGGCTCTT TATTTTGTAT 
CAGCTTCAGG ACCTCTTTGT TCATTTGAAT 
GCTCAGAGGC AAGACCAGAG GTGCCTGGAT 
CTCTGTTCTG TGTTCTCTCT ATAAAATGTT 
ATTTGACAAC ACCAAGCGGT TTCATTCTCT 
TTGAAGATGA ATTACTTCAT GCCTACAGCA 
CCTCTTCTCA TGGTTCGCCT GGCAGTCCTT 
GCCCATTGTC CTCTTCCCAG TAAGTACATA 
TACTTGACCC CATAAATTAG TACATGTGTT 
TTATAGGGTG AGTTTGCAAT TGCAATGCCT 
GCATTTTGAG TCACTTAAAA TTGGCCATTT 
GGTA1 



TGAG 
AGTTTCAGGT 
TTTAAACTCA 
GTAAAACTGA 
CATATTAACT 
TCAI I I I I I I 
TTTTAACCTG 
AATTATATAC 
CATATCTCTC 



TTTA 
AAGTTGAAAA 
GTAGGAGTTC 
TTCAAATCTT 
TCCAGTTTGT 
ATGCCTTGAA 
CAGAAAAAAA 
TCACACATGG 



TAGTGTAGGA 
GTGGAGCTGG 
ATATAACTTT 
ATTAAATTTC 
TGTGCTGCAA 
GGCTAATGAA 
I I I I ICAAAA 
CCTCTGCCAG 



FA 

I I I I I IATTA 
AAAAACAAAG 
TGCAAACGGT 
CTTGCCGCCC 
GTCATTCTCT 
GTATGATTTT 
TATTCAATTA 
GGCCTGATTC 
TATTTTATAA 
ATGATATTTC 
GAGAATGAAT 
CTGCCACACA 
CTCCTCTTTA 
ATTTCTGACA 
AAGAGGACAT 
AAGTGACTAT 
TCTTAACAGT 
GCCACATAAA 
TTACATGAAT 
TTAAAGI I I I 
CATGTATGAC 
CTTAGGI I I I 
GCTTTAGATT 
AGGTCAATAC 
TGAAAATCCA 
TTGGAGTCCT 
GAGTAGTACC 
CACGTTACTT 
GAAAGAGCCC 
TGAGCTTGAA 
GTTTGTAACC 
GTAGGCTGGG 
CAAGAGCACC 
GATAGATTTA 
TATACCTTCT 
GAAGGTTGCA 
TCCCAGGCCT 
GCCATAAGTG 
TTTTCCTGTT 
AAGTGTATAC 
GTGGCAGTAA 
AGACTTTGAT 
CTACCTTCAT 
GAGGATATTA 
AATGTGTAGA 
AAAAAATCAT 
AGCTTCTGTC 
GGAACCTTGA 
ATCACAGTTT 
TAATATACAA 
AAGGGGATTC 
AACAGTTATG 
ACTCACACCA 
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21601 
21651 
21701 
21751 
21801 
21851 
21901 
21951 
22001 
22051 
22101 
22151 
22201 
22251 
22301 
22351 
22401 
22451 
22501 
22551 
22601 
22651 
22701 
22751 
22801 
22851 
22901 
22951 
23001 
23051 
23101 
23151 
23201 
23251 
23301 
23351 
23401 
23451 
23501 
23551 
23601 
23651 
23701 
23751 
23801 
23851 
23901 
23951 
24001 
24051 
24101 
24151 
24201 
24251 



GGTCACCCCT 
CAGAGCAGAG 
CCTAAGGATA 
ACACTGAATT 
AACCAGCACT 
ACTATTTCTA 
AAATATATTT 
TACTACTATG 
TATATCTGAA 
AGCTCCGTCT 
GATAGGGTAA 
GAGCCCTTCT 
TGTAAGCATT 
GCTTCTGTAT 
TTAAGGGATT 
TCAACTAGTG 
GGGAAGTAAA 
CCAAATCTCC 
TATAAATCTG 
TTAGAACCAA 
GCTGTCCTTC 
TTGCACATTA 
TGTAAGTGCT 
TGTGCACAGT 
AAATGGTCTT 
TTTCTAGCAA 
TTCAGCTGGG 
GAAGGATACA 
CTCAATCAAG 
GATAAAAGTT 
GCTCAGATTT 
CAGTTTATTT 
TTGATATCTC 
AAAATCTATC 
GTAACTGGGC 
I I I I ICTCTA 
TTTCU I I I I 
CAAGTGAATA 
CATTTCTTTC 
CTCCTTCCAG 
GCAGTAAAGT 
ATGGATATAT 
AAAAAAGATC 
TTTCACTTAT 
CAACATAGAA 
TCGTAGGTTT 
GTGAGCTACG 
TAGAACTAAG 
CATTGGGCCC 
GGACTCTTTC 
CTTAATGGAA 
CCTGACCCAC 
CAGTAAAGAA 
GGGAATCAGC 



CCCTGGCATT 
CTGGTAGTGA 
AACACAGGTC 
TACTCATTCA 
GTCCTGGACA 
TAATGACACT 
TGGCACATTC 
GTGGCTAATA 
AGTATTTCAG 
TGGGTACAAG 
AATTGCCAGG 
TTTTATGTGT 
CACATCATTG 
TGCTTTATTC 
GCCAGAATTG 
CAGCTCATAG 
AGTCATTTAC 
CCTTAGAAAT 
ACCATCAGGA 
AATTGACCAT 
ATTATGTATT 
TCCTGTCCTC 
CCACTGCCCC 
GACTGGCACA 
CAAATGAATC 
AAGCATAGAC 
TTGAGGATCT 
CAGAGATGGT 
TCCTCTGAAG 
GTTATTGGCG 
TCAATGTGTC 
CATTTAAGGA 
CACACATTAA 
AGCTTTAGGC 
TCTCTTTCTT 
CTTCATTTAT 
AGTAAAAGGT 
AATGAAATGA 
TCTCTTAGTA 
AGGACTGATC 
TTCCATATTT 
AATGAACACA 
CCTAGCTACT 
GTTTTCTTCA 
ATGCAGCGTA 
TCCAACATCC 
TCTAAACAAT 
ACTCTCACTT 
CTGCATCAGA 
TCTGGAAATA 
TCATTGTGGG 
AGCATCTGAC 
AAAGGCACAT 
CAGTGGTCTG 



TGTCATTGGT 
AGATTTGGGC 
TTCACCCTCC 
TTTGAGGGGG 
TAATGTTCCA 
GTGGTCACTT 
CI I I I ICATG 
TTTTACTCTT 
ATGACATACA 
ACAGAAGACA 
CAAACAACTT 
AAAAAAATCA 
TAGAATCAAA 
ATATTCATAT 
GCCATCGTCA 
CCCAACACTG 
TGATGAACAT 
TCTTACCCTA 
ACAGCAATAG 
AAGAACCAGA 
TCCACTCAAC 
ACCCTTGCAA 
ACAGAGTCAT 
GAGGGTGAGC 
TTGCCTTGTC 
ACTCTTTCCC 
GGGCTAAATG 
GACAGAGAGT 
CTAAATTCAA 
CTTTTGCTTG 
AAATGCTGAC 
ACTCTTCACC 
TAATAAAACT 
ATTATTTTGT 
AGGAGCTTGA 
TATCTTTCTT 
GCATAGTAAC 
ATTAAGGTAG 
GTGGGTGGGG 
ATGTCATTTT 
CCATAAAGAA 
CAGATGAAAA 
TGTATTTCAT 
GATCTCCTGT 
TATTGCTATC 
TTGCCTTAAG 
GGGAGAGGCT 
CCTCCTGTGC 
ATACAGCACC 
ACAGATGTTG 
TCAGAGGCCC 
CCACATTTCC 
AGCTAAGTGG 
CCCTAGCAAA 



GTCAGTTTGT 
TGTGTGAGTT 
TGCCAGCTCC 
AAAAAAATAA 
TACAGGGCTG 
TAAATGCAGC 
AGTGCATGAA 
AAATCATGTC 
CATAGCTTTA 
ACTATAAACA 
CACTGAGAAA 
CTCACTAAAT 
GCA TAAGAA A 
AGTGTTTTCA 
CACAGACAGC 
AGGGCTGCAA 
TTCACCTCAG 
TGTGAGAAAT 
TGTGTAAACA 
GTTCAGAAAA 
ATTAGCATTT 
TGTTACATTT 
AAGTCCCTGG 
TCTGTCGTGC 
TTGAAATGTA 
TTGGTGACAT 
AACCAAACCT 
GGTCACTTCC 
I I I I I I I ICT 
TTTATTTCGT 
TCACAGCATG 
AGTAAGTTTA 
AACAAAACCT 
GTTCTCCTTC 
GAAGATATGA 
TTTTCCAATC 
TGCTTGTAGT 
TGI I I I CACT 
TATCAGTTAT 
CAGCTTATGC 
CAAGAAACCA 
TTTCACCTGC 
CTTATAATTA 
TTTGAAGTGT 
AACTGCAGTG 
CAAACCTGCA 
I I I I I I I I I I 
CTCCACATTT 
CCCTAACAGG 
TCTCTAGAGC 
TGGATGGTGC 
AGGTTCCTAG 
AAGAGCAGAT 
GGTAAACAGA 



TCTGAGATCC 
AAAACCACCA 
TGTTTCATAA 
GTGACACAGT 
GCATATGAAG 
TTGTGTGCTG 
ATCAGATCCG 
TTGCCTCTAA 
GCCTAAAATC 
GAAGGTATAC 
AGGATATCTG 
TTTGGCACAG 
TCTGTGATGT 
AGCCATGGTT 
TGGTAACAGT 
TTATTGTCAT 
CATGGAAAAT 
AAAGCACTGA 
TTAGATGCCA 
ATGACTAACT 
ATGAAACATT 
ATATAATCTG 
GACTTGGTGA 
TTGGGAAGAA 
TAAACTGCCT 
GTGCTACGAA 
CCCTATACAT 
GTGAGTGGAT 
TTACTAAAAT 
ATAACTTAGG 
GTTCTCCTGA 
TTTACTTGCC 
AATCTGAATT 
TTTCAACATG 
CTGGGGTTTG 
AGGTTAGTTT 
ATTTGTTGAA 
AGCAGCCCAA 
GGAATGGCAC 
TTCCCTTTAT 
AATAATCCTA 
CATGCCTTTG 
AAATCAGTCT 
ATATAGATAT 
GAGCAGTGAT 
AAATCAAAGT 
TTTTAAGAGT 
TTGACCTTCA 
CTCCTGTTCA 
TGCATAGAAC 
TGGGGACCTC 
CGACTTGTGT 
GAGGCTTGGT 
ACTGCTGGGG 
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24301 GCI I I IGGTC CTAGGCTCAC TACTCAGGGA GGCACTTTAA CATGGAATGA 
24351 CCAGCAAGTT TCCTTCCTGA TCTTTTCCAC CACCACCACA AGCCTAGTAC 
24401 CTCCCTCCCT CTTTGCTCTG TTGCTCTCTT CGGGAATGCA CTGGAAACCA 
24451 CCTTCAGTTC TGTTTGGAAT TTTCCTATTC CTTATTCAGA AAGAGGAAGA 
24501 AGCTTTTGCA TTTACTCCAA CCGTTCTACC TATTATTCCC ATAAACTTTC 
24551 TGTGATCTCA TATCATTAGG CCAAATGTTA ATCTTTCTGG GAGCCAGGAG 
24601 ACTGCTTTCA CATTCAGAGG CCCTGGACAT ATAGGACTGC CTCTAACTCA 
24651 CTCTAACTCA GCTTATTGAC TTGAATGCAC CI I I I IAACA AGTGACTAAA 
24701 AAACAAACTG TGACTATTCT CTGAAAATGA GCCTATATCT CATACTTATT 
24751 TATTCTGTTT AACACTGTGA AACAAATTAA GTCCTCTGGC ACTATGTATA 
24801 TACCATAAAA AGCTTATTTG TAAGCCTACT AATTGGACCA GTTTTGACAA 
24851 TATTGAATAA GCACTAATTG CAGATCATAA TGTAGAATTA TAGGCTGCTG 
24901 AGGAAAACAA TATCACACCA TTTGCTTTCC TCAGTTTCCT TTTCAGAATG 
24951 AGTTTCATAA TGTTCACTAA TCCAAI I I I I AAAATCCTTT ACAAAGTTAT 
25001 TCTTAAACTA TTTCCAGAGA CTATCTGGTT TGTCATTCTA GAAATGAAAT 
25051 TGCCTTTTCA GCCTAAACAG ATGGCCTTAA I I I I IGGTGG AGTGGTATGA 
25101 AAGGAATGTC ACATGAGAAA CTGCAAGCTA TTTAGCTTGA Al I I I I IGTC 
25151 ATTCATACAT GTTTCAAAAT ATATTTTACA TTTTCTCTCT TTTAAATGAG 
25201 TTCCCATCTC TGCACCTTAA GTGACTTCAG AACTAAAATT TTAAAGTGAA 
25251 CATCAATCAC AGCATTTCCA AAAATGTGAA CTCCTAGCTT AACCGAAGTA 
25301 TTCACTTATT GGAAAGCTGA TAGAGTAATT CCACTAAGTC CAAAAAGTGT 
25351 CCTCTAAAAG ATTCCAAAGA TAAGAGTGTT TTCAACTTTG TCAAGCTGTA 
25401 CAAACACAAA TGTCACTCCC TCCCTCTGCC CACAGGGATC TTTATCCAGT 
25451 TACAGCAGCG TAACTTGAGC AGCTGCTGCA AACTGAGGCT CTCTTGACCC 
25501 TTCGCCTACT TATTTCAGCT GCTAAAATAG GGCTGAAATC TGTCAAGGAT 
25551 CCTGAAGGGA AGGATAAGAT TCCTACTATT CAATTTAATT TAAGCTTTTA 
25601 TTCAGTGCCT GCTGTGTGCA CAACACTAAG CTAGAAAGTC TGAGGAATGT 
25651 TTAGATTATT AGGTCCTGTT CCTTGCCTTT CATAGATTTA CAATCTATTG 
25701 ATAGGGAGAG CTAAAAAGGA GAGAAAGAGG AAGGAGCAAA CATAAAAACG 
25751 TCAAAATTTT AAAATACCAT TTTAAAATTT TATTTTAAAA TGTTAAATAC 
25801 CATGCAAAAT TAAGGAAAAC CTAGATTCAT AAAAATTCCT TTCACAATCT 
25851 TGTGTAAATC AATTCAGTGC TTGCCCTTAA TGTCTCATCC AGTCTGATGA 
25901 GACATGTTTT GTGATCAACA AGGGTTTTAC TATGTTTCTT AATTATGTGT 
25951 CTTGCCTGTT ATCTCTTTCT GACCGAGATT ANN IAACA ATAAATTCTG 
26001 AAAACTAAGA AAGTGAAAGC ATAAAATATT GTCTTATAAA ATACGCCAAG 
26051 GAAAAAATGA CACTCCATTT CAAATATCAA AAGTTAGCAT CAAGACTGCA 
26101 CAAGATGAAT GTACAGTCAT GTGTTGCTTA CAAATGTGGA CATATTCTGA 
26151 GAAATGCATC TTTAGGCAAT TTTGTCATTG TGCAAACACC ATAGATTGTA 
26201 CTTGCAGCCT AATTGGTGGA GCCTACTATA CACTAAGGCT ATATGGCATA 
26251 GCCTAGTACT CCTAGGCTAC AAACCTGTAC AGCATGTTAC TGTACTGAAT 
26301 AGTGGAGGTA CCTGTAACAT AATGGTAAGT ATTTGTGTCT CCAAACGTAG 
26351 AAAAGCTACT GTAAAAATAC AGTATTACAA CCTTAGGGTA TCACTGTCTT 
26401 ATATGTGGTC TGTTGTTGAC CGAAATGACT ATGCTTAATA CCACTGAACT 
26451 GTACACTTAA AAATGGTTAA GATGGTAAAT TCTATGTTAT GTATGTTTTA 
26501 TAATAATAAA AAAATTGAAA AAAGCATCAA CATC I I I ICT GGGAAAAAAG 
26551 AAAAAGAAAG AAAATGCATT AGAGTGATGA GAATATTTGA AGTAATAGAT 
26601 AAAGTCAAAA ACAAAGAAAT GATCTTGCCT TTGAACTTTC TTGTTTAAGA 
26651 TTCGTACATC AGTGATCACA CTGTTATTTC CCAAACGACC CTTCAGCTGG 
26701 ATACGACATT TCCTGATTGC AGCTGTGCTT ATTGCACTTA ATAATGTTCT 
26751 GGTCATCCTT GTGCCAACTA TAAAATACAT CTTCGGATTC ATAGGTGAGT 
26801 TTCAGAAAGG CTTCAATTTG GTCAACCCAA ACTCACGCCT CATTAAATGA 
26851 TGGACAGGGA ACCAGTGCTG GGTCATCCAG ATCCCCGTTC TTTCTCAGGC 
26901 TCATGGATTC CCTTTATCCC TGCGAGGCTC TGGTGATTGA GCTGCTCACT 
26951 GTCTCTTCCT CCTAACTGAC ACTGGGAGCC ACCTTATAGG TCATTTAGTC 
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29701 TTTCCCCTGC CGTCTCCTCA TCCTGCACCC CCTAAAAAAG TACCAAACAT 
29751 TTGCATTCTC AGAACATCAA ACAAAAATGC CCTGGTGGCA AAGCTATCAC 
29801 CATTTAATGT CTTCTCTCAG TCTTGCACCA AAGTCTCTGG TCTGTTTACT 
29851 AACAGAGGCA AAAGGCATGT CTTAGGAACT GTTTCTGTTT CTGTAAGGTA 
29901 CATGAATGGT CAAACACCAG TCTAGAGCAT CTTATTGTCA ACAGCAAAAT 
29951 AATATTTTGC CCACCCTGTT TGTGACATTG AGTTGTGACT TCTATATTCA 
30001 ATAGAI I I I I GTAAATGTTA AAACATCTAT ATTTAAATGT TAAAACACTA 
30051 AATATAGAGA GGGGCTTTAT TTCAATCATA GAGCAACAAC AAAAATAATG 
30101 CTTATAGCTA AACTGCCTGT TCTAGAAAGC ATCTGCTTTT TCATGTTATT 
30151 CCTAAATCCT CTTGTCATAC TTTTGTCATT GAACAATGCT CTCCCTCTCG 
30201 TCTTCCATCC TCATTCAGAA I I I I IAGAAG ACCACAATCG TGGAGATACA 
30251 CTACCCAGTA TTGTTTGATA CAI I I I I ATT TGATAAACAT TCAGTGCAGG 
30301 AAACTGTGAT TTGCTATATG TTTATGTATA TAATCTTATT CTGTAGTCAT 
30351 CAGAATGTTA ATGTAAGGTA CATTTGATTT TTATTTTTTA CATGTGTAGT 
30401 TTTCTTTCTT CACAGTCAAA GCATTTATAT TATTGGGGGT GGGGGCAGGG 
30451 AATTAAGTTG GTGGGCTCGA AAATCCATTC ATATGTATCT GTCTACAAAT 
30501 GTCTGGGGAT AATTTAAATT TGAAACCTAA GTTATATATA GTTTGGCAAT 
30551 GCTCTTCTTC AATATTTACA ATAATAGGAT GATCTACAAG AAAATAAGTT 
30601 TCTTTTTGCA AAI I I I IATC ATACTAAAGT TGTTCTTTTA ATTTAGCATA 
30651 TCTAAAATAG GAATTAGTTC AGTTTAGCTC ACACAGGTGT TTGCTGACAT 
30701 TCATTGGCCA TTTAATACAG TGTTGAGTGG TTCTCCGTAA AAGTATAAGT 
30751 GCTAACACTA CGAAGAAATG CACACGATCA TTCTTGCTCA CTTCTATAAC 
30801 AAACTTACAT AAAATGGATT TAAAAATTCC TACTCACAGC CTAAAACTTC 
30851 TGGAGTTCAC TACCI I I I I I TCAAATCATA GTAAGATCAC TTGTGTATTT 
30901 TATATTTTAG TAAAGCCAAT TATGAAGTAC AAGTATCATA CACGTACTTT 
30951 TGAGCTACTA TTATTTGAAA AAAATCTGCC AAATAGCATC TTTAGGATAT 
31001 ATTTACATTT TCACTCATCT AAAAAGTATA CAAAAATAAA AAGTGGAAAA 
31051 AGGTATCTTC TGAATGTTCA AGAGCATCCT ATAGTGCCAA ATAATAAAGC 
31101 ACCAI I I I I I TCTTCATAAC CAGGATTAAA ATTCATATAT ACTGCAGGGC 
31151 AGACATACAT ATGATAGCTT GTGCTGATTA ATTTAACCCC ATTTGTAAAC 
31201 AGATGAAAAT TTTATTTTCT TATTTCATTT ATAAGATGGC TCAATGTATT 
31251 GGGAGGCTTC I I I I I IATTA CAGAAAGTGT ATATTGGTAT ATAATAAATG 
31301 AACTTTTCAA ATGACTATGA TGTGAI I I I I GATCTATTGT TAAAGAATGT 
31351 TGTGTTATTT GTCCATGAAA CAAAATTTAA AATCCAAATA CTGTCTTTCT 
31401 TATATTGGTT TATGTTCCAT TTTCATTGTT ACCTTTGACA CATAACTAAC 
31451 ATCTATAGCC ATCATCCTGA AAATAATTGC CATCTTATTT TGGCAAAATA 
31501 GATATTTAAT CCTAAATTAT TATGATGATT ATAATTTTGG CATCACATAT 
31551 ATACCACCTA GAATGAATGT GGAAGAAATG AGTLI I I I AT GGTTAGTTTG 
31601 AAAGAATCCA TTGAAGATAG AAAATGAGAG AATAGAAGAA ACCTGAGAAT 
31651 AGTAAAATAA AGAGCAGAGA AAATATGGGG GCAGGGAAAA CATGTGAGTG 
31701 CTAAGGATTG ATTATGAATG AACGATTAGG GGGATTGATG GATCACAGGG 
31751 TAAGTATATG CTTAACTTTA TAAGAAACTT CCACATAGTT TTCCACAGTG 
31801 TTTCTACCAT TTTCATTTCC ACCCGTACTA CCTACAACTT CCACTGACTC 
31851 CACAGCCCTG CCAACATTTG GTGTTGTCTT TTGCATTTTA GCCTTTCTAG 
31901 TGGGTCTGAA ATGGTAACTC ATTGTGATTT TCATTTCTGC TTCTGTGACA 
31951 ACTAATGTTG AAAALI I I IC AAGTGTTTAA TGGTCACTCA TATATCTTCT 
32001 TTTGTGAAGT GTGTATTCAA ATCTTTTGCC CAI I I I I AAA ATTTAGGTTA 
32051 TGTGTTTTTA TTGGGTATTT GTAGAAGCTC TTTAAATATG GATCCATGTC 
32101 CAGATTGCCA ATATATTTTC CCAGTCTATG GTATGGTTGC TTATTTTCCT 
32151 AAAGGTGTCT TAATTACATC TTTCTGGGGC CAGGTCACCA TAGCTCAAAG 
32201 TTTTGCAATT TATGTCTTAA TGAGATAATA TTAATCAGAG TGGTATAGTC 
32251 AAAATTAAAT GTTTTGATGT CCTGGGCCCA TATAGGTAGG ACTGGATCAT 
32301 CTAACCAAGA TGCAAAAAAA AAAAAACAAA AAAACAAAAA TAGTACTTGG 
32351 AAAAACTTAT TTTAAATTAA ACA (SEQ ID NO: 3) 
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ACCCATATGCATGTCTT ACTTCTATTCTCTCTTAGCTTTTAACCTGC I I CI I I ICATCTT 
TTATGTATATACATTTAGGCTGCCTTATATTMTMTAGrrTCAl I I I lb I ICCTCCTGC 
TTAAAACACTGTGTGCTA I I I I I I I AAATTCTGAGAACTGC I I I CI I I ATTTCTAGACAA 
TTCTCTGCCATTATCTCTTTCTC I I I IGTCrCACCCTAGTCTCACAATTCTCTATATTGG 
MTGACrATCAGTGTATATTTGAACTTGTAATTCTTAI I I I I I CCCCATTCCTCTTAACT 
[T,C] 

CTTATTTGTA I I I I ICI I I I I I I MTCTCTTCATGCTATMTTTGAGTGATTTCCACAGA 
TCTGTCTTTCMTTTTATAAGTCTTCCTTCAGCTGAG I I I I I I I AAATTTCAATGATTCT 
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Al I I I I I IU I I I I I I I AAGAATTCL I I I I I I IGACTLI I I I I GCAACAGCCTGTTCTCC 
TTTTATATTCCTTTATAATG I I I I I ATTCTGTGAMGTTATTCTCTTATTTTGAATGrrT 
TCTTTCAAAATGTC I I I LI I I I I ATTMTTTMTGTAAMGTCCCTTTTAMTTGCTTTG 

SEQ ID NO: 6 

2594 CTGAALI I ILI I I IGI lACrATTQTMCrrrGGCTTCAGGATCCAACTGCCTAGAAAGT 
TACTTCCTAAACTTGATCCTG\C(TATGTTGCATATTATCMGC^ 
TCTTTCATGT CCAATTAAATTAAAGCAGTAA I I I I LI I I CTAGTTATTGCTAGTAGAGAC 
ACTGGTAGATTCTGCCITGGTAGACClTCCTCTGrCMCMTTT^ 
LI I I I AAAACATGTATCCCACTCACAAATACCrAAATTTCCTTGAAGACrGCr 
[T,C] 

TTAAGAI I ILI I I I I I I I I CCATAGTGACTAGTAAAACCTGCCATTTTCATTATACATAG 
GCACTCTATAMTATCTGCTMTTTAGCMTTATTAGTMTTTCLI I I LI I CTCTTCCAT 
TTCTTCLI I ILI I GTATTGGGTAMGGMCATTTCAGGATTTGCTTATGTAAAG I I I I CA 
GGAGI I ILI I I CCrrCCTCCCTTTTACAGAGAGX^TACAAMTGTAGATGATTCATAT^^ 
ACTTATTTCATTTAAATAAMTTATM 

SEQ ID NO: 7 

2757 TTATTGCTAGTAGAGACACTGGTAGATTC^^ 

TALI I I I GTCTTCL I I ILI I I I AAAACATGTATCCCACTCACA^TACCTAAATTTCCTT 
GAAGALTGLTGCCATG I I I IAAGAI I ILI I I I I I I I I CCATAGTGACTAGTAAAACCTGC 
CATTTTCATTATACATAGXK^CrCTATAMTATCT 

TCLI I ILI ILTCTTCCAI I ILI ICLI I ILI I GTATTGGGTAMGXW\(^TTTCAGGATTT 
[G,T] 

CrrATGTAMGTTTTCAGGAG I I ILI I I CCrTCCTCCCTTTTA(^GAGAGCATACAAAAT 
GTAGVVTGVVTTCATATTCACTTATTTCATT^ 

CrGTTTGCAGV\ACAG7^GTGTTCTGMCATCMCACAAACT ^ 
GTACAGTATATTATTTACACTGV\AGX3GGCTTGTGTCT Q _ ^ 

AAATGG7\TCC(^TGGAACTGACAAATC LU 

SEQ ID NO: 8 > 
6107 GTITCGTGTGCTGTTTCTATCTACATCTCATACTG I I I I CTATTLTCAAAAAGTAACCLT mj _ 
GTCATCCrCTTTCCTCTC^GATTATTTTCAGGATTAGCT^ f") ^ :E 

GTACAGATCTCCTACMTMTTATTTTCTATTTTA^ Ml ^ 

TTGAGACAGACAGV\GTTTL7VCTCTTGTGGCCCATGCT Jfl -> ^ 

CTCACTGCMCCTCTGXICTCCCAGXjr^^ LL - cs> 

[C,T] 

TGGGTVTTACAGX^GCCTGCCACCACACTCGGCTAAL I I I I I GTATTTCTAGTAGAGACGA 
AGTTTC^CCATGTTGGCCAGXXTGXJrCTTGAACTCCTGACCT 
CAGXCTCCCAAACTGCTGGX^TTACA^ 

ATTMTAGMCAATCTTCMTTATTTTATL I I ILI I I ATL I I ILI I I I CATGTAGGAAAT 
GTCCTAAMTTTTCAMCCCTCMTTTGAAAGCAL I I I I AAAATCATACATAGTCGAGCA 

SEQ ID NO: 9 

6392 CAGTCTCCTGAGTAGCTGGGV^TTACAG^ 1 1 I I IGTA 

TTTCTAGTAGAGV^CGAAGTTTCACCATC 
AGTTATCCACCCACCTCAGCCTCCCA^ 

TGX^CTCTAGGATTATATTMTAG7^(^TCrrCAATTATTTTATL I I ILI I IATCTTTC 
TTTTCATGTAGGVWUXJTCCTAAM I I I IAAAAT 

[T,C] 

ATACATAGTCGV\GXATTTTATATAAAAACAACTAAAMGTCTCT 
AAMTGCMTGX5G^GCAGCAGG^CTTATTMTTG^G^CT 
AGGTCCGTAC£CTCAAAGCCCCTGXTT^ 
MCCMGTTGTACATCTTCTAGXICTGT^ 

ATAGTGACATCTTAAAG I I I ILI I I CCCTCCCAACTL I I I I GCCAGTTCATTGAATTGCT 

SEQ ID NO: 10 

9484 GXA^CATTTATATCACAAATAT 

TCACACAGCTATTTG7\GXKiCTMGC^^ 
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TTGATTGMCTCriTTATGTATAATATTCTTCAGCrGAATGGGI I I I I ATATCMCTTTA 
CI I I I ATATAAGCCATG I I I IGAMTAMCTAGGATTTTAATAATCTGAATTTTAATAGC ^£ MAP * r< ' 
TATGTATGTAGTCATATATTTGTATGCTIT^ 
[C,G] 

CTGCCTTTCCTTATTMTTATACATACCATTAAAATG 
ATGMTAGAMTAGGAAAMCTTC^^ 

AAGAATGTTCCAL I I I I I AAMTGAQGGCCTCATTTTAGGCTTATAMCACTTAGCAGAT 
GMTTrGGrCAGMCMTTAAATCACTAAACATCATGGGGTGTGI I I I GTGTGTCTAAGT 
AGCCCAGACTGGATTAAGCITTCTCTCTT^ ATTTTAAAG 

SEQ ID NO: 11 

10280 ATMGACTGCMCATAGCTACAGGGCTITA ACA 
TTTGTAGTATTGATTTCATTGGGMTTACCMGGGATTAG^ 

Al I I I I I AAAAATAAACAAAGATAAAGA I I I I I I I I CTGAATTCCAGGTAAAAGGCAGCA 
TTGCTCCTCCATTTATTACGTAGATGCTTCTATCMG^TTCTTA I I I I I GTGCTCCAAAT 
CTTGGATTTGGAAAAATACCMTCCGTATAMCATAMGAMCCATACATGCATGT 
[A,G] 

TCCTMG^CCAGAMTGACTCTGMTGCAAAAAAAAAAAAAAAAAAAAMGGG 

GTGCCCCATCCTTAGCTTTCTCTGCTTTCTCTATTATATATGC^ 

TCTTAOWVGTACTTCGTMTCTMTGCACAGGATCAGCAGTMTGC^ 

TGCTTTCGCCTTTGGATTCCTAGATTT^ 

CTTCMTTCTMGTGCTGATGTGMTATCATGCAMTATGAT 

SEQ ID NO: 12 

10297 CT ACAGGGGTTATAAMTTTATMTTCATGGTCCAMTGTACATTTCT 

ATTGGGMTTACCMGGGATTAGATCAATTGTGGGGAAAGTGTA I I I I I IAAAAATAAAC 0 
AAAGATAAAGAI I I I I I I ICTGMTTCCAGGTAAAAGGCAGCATTGCTCCTCCATTTATT ^ ^ 
ACGTAGATGCTTCTATCAACATTCTTA I I I I I GTGCTCCAAATCTTGGATTTGGAAAAAT (—J ^ g 
ACCAATCCGTATAAACATAAAGAAACCATACATGCATGTGGGGATCCTAACACCAGA UJ S 
[G,A] > 
ACTCTGWGCAAAAAAAAAAAAAAAAAAAAM LU 
TTCTCTGCTTTCTCTATTATATATGCAACTGCCTGCCCCT Q 
TMTCTMTGCACAGGATCAGCAGTMTGCAGCTG^GACTGCATGC^ LU 
TCCTAGATTTCAGATTAAGGTTTAGTOVGGCTATTG^ £T 
GATGTGMTATCATGCAMTATGATGTACATATTCCCATGTGCTGAGTAAGTAGATGTAG O 

SEQ ID NO: 13 

10331 AMTGTACATTTGTAGTATTGATTTCATTGa 

GGAAAGTGTAI I I I I I AAAAATAAACAAAGATAAAGA I I I I I I I I CTGAATTCCAGGTAA 

AAGGCAGCATTGCTCCTCCATTTATTACGTAGATGCTTCTATCM I I I I IGT 

GCTCCAMTCTTGGATTTGGAAAMTACCAATCCGTATAMCATAA 

CATGTGGGGATCCTMC^CCAGAAATGACTCTGAATGCAAAAAAAAAA 

[G,A] 

GGMTTTTCGTGCCCCATCCTTAGCrrTCTCTGCTITCT 
GCCCCrCTATCTTACAMGTACTC 
CAGACTGCATGCTTTCGCCITTGGATTCCTAGATTTCAGAT^ 
TGAATAGCCCTTCAATTCTAAGTGCTGATGTGAATAT^TGCAAATATGATGTA 
CCCATGTGCTGAGTMGTAGATGTAGCATTTGCTAATGTTGCTATACATTTAGC^ 

SEQ ID NO: 14 

10536 TACCMTCCGTATAMCATAMGAMCCATACATGCATGTGGGGATCCTMCACCAG 
TGACTCTGAATGCAAAAAAAAAAAAAAAAAAAAMGGGMTTTTCGTGC 
CTTTCTCTGCrrTCTCTATTATATATGCMCTGCCTGCCCCT 
CGT AATCTMTGCACAGGATCAGCAGTMTGCAGCTCAGACTGCA^ 
ATTCCTAGATTTCAGATTMGGTTTAGTCAGGCTAT^ 
[T,C] 

TGATGTGMTATCATGGAMTATGATGTACATATTCC^TGTGCTGAGTAAGTAGATGTA 
GCATTTGCTMTGTTGCTATACATTTAGCATCTMGTTATGMCCAG^ 
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CTMCATTAAAAAAAACTITAGGGACTTCA^ 
CGACTTTAMGGCTATCTCTAGACTTCT 

ATALI I I I GGAMGCTGATGATTGAAMGATTMTGTGATCCTTTATTGrAACATCrAAC 

SEQ ID NO: 15 

ACCATTGATTL I IGI I IGGAGAACAT 

ACATG 




TTGATATATTGCTTATTGGT 
TCI I I I GGGCTTATAATTTCrATATGATGTTTAT 



rTTGAGGTTGCA 
GAGACTCCAGCATGGA 



ATTATATGACAAAMTATTTTAGTCATTAAMCMTCTCITTMCAAGG 
TTGATTGTAGGGTCTTTGATTTATGAAAMTTAGGAGAA 

AAAATTGGAGLI I I IGI I I CCATTACAATGCAGAACATTGGAGGT AAGGGGATATACTTT 
[T,C] 

CMTGGATCCCATAMCTTTCrATAGCGTGTTCMTAMTMGAAMCTTAT 
ACAGGC^CrrTAGATACAGAAAAATTGCTACTTATAG I ILI I AAATTTTAAAATGATAGT 
TTCTTAMTAGGrrTGTGTCCTGCTTTMTTAAA^ 

CATATAAMCCCTGCCMTTGMTTCTAGMTTAAAATATAAAATAAAAGC I I I LI I GAT 
TTTTMTGTTATTATAGCATGMTTAT^^ 

SEQ ID NO: 16 

TTTAGATACAGAAAMTTGCTACTTATAG I ILI I AMTTTTAAAATGATAG I I I LI I AAA 
TAGGTTTGTGTCCTGCrTTAATTAAAAACAG 

ACCCrGCCMTTGMTTCTAGMTTAAAATATAAAATAAAAGL I I ILI IGAI I I I IAATG 
TTATTATAGCATGMTTATTACTCTTAAAMTTGM 

GACAAAACAGTTGACG I I I I CT ATGTGTGACTGAGTTCGATTTACTAMCTGAAAAGTGG 
[G,T] 

TGTCTGGGGGMCATAGCCAMTGXTGTGGTCCTTGAMCGCAG^ 
CCACTAGACAGTGTCTCTGGMGTTTACTMGG 

CTATAAACCCCGG I I IGI IGATTCTATGGATTCrTATMTTCCCACTGMTTATCATTTC 
CAGTGTAGGACCTAGAAATATATATATATA I I I I I AACAATGTTCTCTCGTTGGTGTGTT _ 
TGCCCACCAGCrTCATACTGTTTCT^ 

SEQ ID NO: 17 LU 

GACTATTGCAGTAGTCTTa7\ACTGCT 

AACTCTAAI I IGI I CTCCAGATAMCTTTCTCAMTTTG7\GTCTGTTTCTAL I I I IGTCQi] 
TGCATAAMTTCTTCAGCATGCCTTTATTATIT^ 

TGACACAAGATCTTCGTCTAG I ILI I CTGCTCMTCnTCTAAACTTT C(TAGCAATGCC|T| 
CATATCTATCXATCTTTATCTATC^^ 

[T,-] ^ L 
ATCATCTATCAATTTATCCATCATCTA^ 

TTATATTTATTCCCCTAACAGTACTATTTTAATA I I I I I AAAAATCATCCATGCL I ILI I 
TTCACAGGCTACTTTCTCCCCTTGACTGTCTCTCA 
CAC^CACACACACA<^O^O^CA<^CACAC^CA<^CACATT^ 
CTGGTCTATTGXTCCTCTAG7\CTGGTAMTACTAGTTCCT 

SEQ ID NO". 18 

ATTGO\GTAGTCrrCTAACTGGTCTTCCTGGCTTGA 

CTAAI I IGI I CTCCAG^TAMCTTTCTCAMTTTGAGTCTGTTTCTAL I I I IGTCGTGCA 
TAAMTTCTTCAGCATGCCTTTATTATTTTCMG 

ACAAGATCTTCGTCTAG I ILI I CT«rT<^TaTTCTAAACTTTCCTAGO\ATGCCCATA 
TCTATCTATCTITATCTATCTATCTATCTATCTATCTATCT^ 
[A,-] 

TCTATCAATTTATCCATCATCTATACCCTACATGTCCTCT 

ATTTATTCCCCTAACAGTACTATTTTAATA I I I I I AAAAATCATCCATGCL I ILI I I ICA 
CAGGCTACTTTCTCCCCITGACTGTCT 
CACACACACACACACACACACACA(^CACACACATTT^ 
TCTATTGCTCCTCTAGACTGGTAMTACTAGTTCCTCTGGGCTCT 

SEQ ID NO: 19 
GCAGTAGTCTTCTMCTGGTOTCCTGGCrrGAGTTTCCCCTGCTCT 
Al I IGI ICrCCAGATAMCTTTCTCAMTTTGAGTCTGTTTCTALI I I I GTCGTGCATAA 
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MTTCTTCAGCATGCCTITATTATTTTCMGGAAAMCrrAMCT 
AGATCTTCGTCTAG I I C I I CTGCTCMTCTITCTAMCTTTCCTAGCAATGCCCATATCT 
ATCrATCrrTATCTATCrATCrATCrATCTATCTATCTATCTATCrATCTATCTATCATC 
[T,-] 

ATCMTTTATCCATCATCTATACCCTAC^TGrCCrGTGrCAMCCATMOV^ 
TATTCCCCTMCAGTACTATTTTAATA I I I I I AAAAATCATCCATGCL I I C_ I I I ICACAG 

GCTACTTTCTCCCCTTGACTCTCTCTCAMCTCCT 

ACA(^CACACACACACACACA(^CA<^CA<^TTTTCTCTCr(^CT 

ATTGCrCCTCTAGACrGGTAAATACrAGTTCCTCrGGGCTCrCATGGTCCT 

SEQ ID NO: 20 

13019 CTGACACAAGATCTTCGTCTAG I ILI I CTGCTCMTCTTTCTAMCTTTCCTAGCAATGC 
CCATATCTATCTATCTTTATCTATCTATCTATCTATCTATCrATCTATCTATCTATCTAT 
CTATCATCTATCMTTTATCC^TCATCTATACCCTACATGTCCTGTGTCAAACCATAACA 
MTTATATTTATTCCCCTAACAGTACTATTTTAATA I I I I I AAAAATCATCCATGCCTTC 
TTTTCACAGGCTACTTTCTCCCCTTGACTCT 

[c,-] 

GCACACACACACACACACACAG\CACACACACACACAG^ 

CCTGGTCTATTGCTCCT CTAGACTGGTAMTACTAGTTCCTCTGGGCTCTCATGGTCCTG 
TTTGTATCTAGT ATGTTACTG I I I I CTAMGGATATTTTAAAACACTTGAGTAGAGAATA 
AGCTTTTCX^GTCTGATGGACCrGMTTTGAGTCTGTTTCTOT 
GGAAGATCACTGTACTCCTTTGTCTGA I I I I I I CATGTATAAAAATTACCTTACAAAGGC 

SEQ ID NO: 21 

13022 ACACAAGATCTTCGTCTAG I ICI ICTGCTCMTCTTTCTAAACTTTCCTAGCAATGCCCA 
TATCTATCTATCTTTATCTATCTATCTATCTATCTATCTATCTATCTATCTATCT 
TCATCTATCMTTTATCCATCATCTATACCCTACATCT 

TATATTTATTCCCCTAACAGTACTATTTTAATA I I I I I AAAAATCATCCATGCL I ICI I I 

TCACAGGCTACTTTCTCCCC1TGACTGTCTCTCAM 

[A,G] 

CACACACAO^O^O^CACAC^C^CACACACACAG^CATTTTCT 
GCTCTATTGCTCCTCTAGACrCOT 

GTATCTAGTATGTTACTG I I I I CTAMGGATATTTTAAAACACTTGAGTAGAGAATAAGC 
TTTTGGAGTCTGATGGACCTGAATTTGAGTCTGTTTCTGT CACTATCTGTGAACTTGGGA 
AGATCACTGTACTCCTTTGTCTGA I I I I 1 1 CATGTATAAAAATTACCTT ACAAAGGCT AT 

SEQ ID NO: 22 

13285 AOXnOOCAMGTCCTCCAACCCTM<^^ 

CACA(^CACACA(^TTITCTCTCTCACTCr(KTCACCT 

GTAMTACTAGTTCCrCrGGGCTCTCATGGTCCTGTTTCT 

CTAAAGGATATTTTAAMCACTTCV\GTAGAG 

ATTTGAGTCTGTTTCTGTCACTATCTGTGAACTTGGGMGA 

[ G,A] 

Al I I I I I CATGTATAAAMTTACaTACAMGGCrATTGTGAGGATGAAATAAGGTAACA 
TATGGCACATMTAAGTGTTCTGTATATGCTTCTCTCCT 

TCCATGTCTCTGGAGTTGCCTGAATTA I I I I I I AAATAGGCATTTAAAAAATTATAAAAC 
AAATATATGATGATTGTGAAAMCTAAMCACTGCATAAATATATAMT^ 
GTTTATGTCAGT^TCCTCAGAMTMCTACTCATAGGTTTTCCCCTATGCCT 

SEQ ID NO: 23 

14461 TATCGAGCATTTCATAGGATTGCCrTATAGTTGGTCT^ 

GCATAAGCATMTTAACCCTGGACTCAAGMG1TGAGTGGCAGCACCTCAG 

AMGCATAGCCACTACTACGCrrCTAMCAATGGMTAMGTATAMGCGGT 

CMGCCTCACACAGCTMGAGGCCTGACTTTMGG 

ACCCCAGAMTMTGCTCTCACTTTGGTTACTTTATTTG^ 

[G,C] 

AGAMTCACTTGTATTTCTCTATTTMCMCTCTACAT^ 

TCCCCTAAAAAATTAACATrTACTGCAGATG I I I I CACATTAACAGATTAATGTCTGGAT 



0/> 




Q 
LU 

> 

LU 

O 
LU 



0^1 

CD 
CD 
C>vJ 



CD 
CVl 



CO 



ZXZ 



FIGURE 3R 



15464 



15469 



15545 



16199 



16798 



Title: 



Docket No.: CL001010 
Serial No.: 09/776,705 
Inventor: Karl GUEGLER et al. 
ISOLATED HUMAN TRANSPORTER PROTEINS.. 



CATTCTGAAI I I I I GMGACCAMCATGTTAACATCACTGACATCACTGAAAACG^GCAA 
TTMTAGCTGTM(^TTGAATGGTACCTCACCMCK:CAGCTMTG^G^MTATCTCCrGT 
GTrTCACACTCTGTMGATTTAGCrrTAGCCMGGTCTT^ 

SEQ ID NO: 24 

TGAGTTCTA I I I I IAACTGAATU I I IGGCCATGTC7rCAACAAATTAACGTTATCCTTG\ 
CCAMTQGGTGGGCrTGAAAMGGCGTGATGCATAMTATTTACAGTTGTAGGCAAM 
GTAATGTTATGTATATGAATACATATTCA I I I I I I CAGGGAGAAGGCTTGTAGATTTCAT 
CMGAMTCTITCACMGAGTAGATMTCATTCATGTATCACTTACCrAGATGCT 




AATTTTGCCACTTTATATMTTCCTTAGT^ 
[-,G] 

AAAAAAAAAAU I C_ l I I GACAMGATGGAGAGMGCTGTCATCTCrTGTATTC. I I I IATC 
AATCCAGGAAGCCTTTGG I I I I GACAATMGrGGTCTGAGACTTTGrGTACTCCTCAGAT 
AGGTCCCGGAGGACTAGATTGGTGCCCATCTGCAGAAMCCAGAGGGGATATATTGACTC 
TGCAGATCTGCCCrrTGATTCTGCCATCrCrCAGCTGGCCCATGCLI I I lb I IGCCAGAC 
TACTGCCCMGTTATAGA(^CTM(^CAGGCACACTGAGrATGGGCTATGrrGATTTATA 

SEQ ID NO: 25 

TCTAI I I I IAACTGAATU I I I GGCCATCTGrCA^CAMTTMCGTTATCCTTCACCAAA 

TGGGTGGGCTTGAAAAAGGCCTGATGCATAMTATTTACAGTTCTAGGCAA^ 

GTTATGTATATGAATACATATTCA I I I I I I CAGGGAGAAGGCTTGTAGATTTCATCAAGA 

MTCTTTCACMGACTAGATMTCATTCATGTATCACTrACCT 

TGCCACTTTATATMTTCCTTAGTTAGCCAAAAGGAGACTAAG^ 

[-,A] 

AAAAALI I CI I I GACAAAGATGGAGAGAAGCTGTCATCTCTTCTATTL I I I IATCAATCC 
AGGAAGCCTTTGG I I I IGACAATAACTGCTCTGAGACTTTGTGTACTCCTCAGATAGGTC 
CCGGAGGACTAGATTGGTGCCCATCTGCAGAAMCCAGAGGGGATATATTGA 
ATCTGCCCTTTGATTCTGCCATCTCTCAGCTGGCCCATGCC I I I IGI I GCCAGACTACTG 
CCCAAGTTATAGACACTMCACAGGCACACTG^ 

SEQ ID NO: 26 

AGGCCTGATGCATAMTATTTACAGTTGTAGGCAAMTTCT 

CATATTCAI I I I I I CAGGGAG^GGCTTCTAGATTTG\TCAAGAMTCTTTCACAAGAGT 
AGATMTCATTCATCTATCACTTACCT^ 

TCCTTAGTTAGCCAAAAGGAGAGTAAGATGAAGAGGGGGGAAAAAAAAAAL I I LI I IGAC 
AAAGATGGAGAGAAGCTGTCATCTCTTGTATTL I I I I ATCAATCCAGGAAGCCTTTGGTT 
[T,C] 

TGACMTAAGTGCTCT6AGACTTTGTGTACTCCTCAGATAGCT 
CTGCCCATCTGCAGAAMCCAGAGGGGATATATT^ 

TGCCATCTCTCAGCTGGCCCATGCLI I I IGI I GCCAGACTACTGCCCAAGTTATAGACAC 

TMCACAGGCACACTGACTATGGGCTATGTTGATTTATMCT 

GMCTGCAGCTTCACTCTAMCrrTGGAGCAGGATTTMCACAGM 

SEQ ID NO: 27 

AGAACTTGGAAGCAGTGCCAAATACACAATGAL I I I I I I I I CCATTTQGGGGATTAGATG 

TTG\TCTTACATATCCCAMTCTCATMCTTGCTTGCATCT 

CCATTMGCTCTCACATTTTCCATTTTAGCMTGTCA^ 

TGAACTACCTGMGTMTCAGAGCATTCATGGGACTTGMGAAMTACTa 

ATGCTCCCTCTCTGACAT^CTGACTCATTCTACTTGGTCTTTTCT 

[T,C] 

TGTCTCTCACTTCTAGAGMTGCTACCTCMTGG 
TTGGMTTATTCTTCCAC7TTCGCT 

AAATAI I I I IAI I I I I ATTTCA(^TTTAAATAGGTTAGCTAATTGTAGATGCCATATTCA 
CCTTCCAAAATGCTTCTTCT AACTTCTAGGrTATCTTGGCTATACCACTGGATTTTCTCT 
TACCTGCATGGTG I I I I I IGI I ACTGTGGTAACTGATCTGATGACATGATCCTTGCAGGT 

SEQ ID NO: 28 

GTTGGTTAGCATGACj I I I I I I I GTGCCTAAATTAGTGTCCTCA I I I IGI ICAAGCACTTC 
ACTAATATGAAATAG I l(_l I GTATCACAAGTGA I I I ICI I GTAGACTAATTTAGAGCAAA 
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AAMGAGCAGCTACGATTTAMGATAGTTGAGGTAGMTATCAAAGCrACrACTAATI 
TTGCTCTAGGCACACTGGTTTATATATGG^ 

<^TMTCTGGCATTTAGMCAGCAGAGGAGAGTCCCAGATGAGAM(^AGAAGGCrATA 
[T,C] 

CCATATTC^CATGMTCAGCCATTCTCTCTTACACATTCCACCCATTMGA^ 

MCAGTGGGATTAMGMGAMTCCTCCTCTCTAGGCCC^ 

TGCACTATCATGAATGCCAAMTTTATAMGCATTTCCCCAMGAGCTAM 

AAAAMGTTTTGMGACCCATGTCACCTTAGTTTGMGAMTMGGAMTGATCA 

CTCATGGAAGGGCATGAAAGAGGGTGGGMGGATTCTTGCAAAATATTCT 

SEQ ID NO: 29 

18103 CATTTTAGCATTCTAATTTOCT 

GAMCACAGTTTATAGCrrCCTCrrCAGAGAAMTATGTACTCCArCCACrCCT CAGTAA 
CATGCTTTMTCAGAMGGTGGGMTCAGCCCACCACAGCACTACCTTATL I I C I I I CTC 
TCCTTTCTCTCCACCATMTGGTTCAGGGGAG 

ATGGTTGTMTMTTTTGGCAGGTGTTGGGMTTTAMTTTGAA I I I IGI ICGGAAGAAA 
[C,T] 

GATGT(^GCTGGACTAGAMTGAAMCACCCATGACGACCAAMCTTATGGTTAGGGGCA c=> 
GCCrCGATMGC^GTGATGrCATTTATAGTCAGCACCTAACCCrrGrCTAGAACACATT Q ^ 
CATTACAAGAGATGTGTCAATATCTCTCC I I Id I I GTCTTATTTGTACAATAGAGrCACr Uj g § 
GGCTAGAAAATC I IGI I ICI I CCAGCTGATGGTCTATGGTT^TTTGTATTCTTTTCCCT ^> ^ g 
TTGAAGI I CI I GATATTTGCTTGGGMCAMGGATATGAACTCATTATAGCTGTTTTCCr rr: ^ Z~ 

SEQ ID NO: 30 }^ ^ ^ 

18421 AMTGAAMCACCCATGACGACCAAMCrTATGGnrTAGQGGCAGCCTCGA^ , ' 

ATGrCATTTATAGTCAG^CaAACCOT&TCTAGMCACATTCATTACMG^ J£f 
CAATATCTGTCC I I IGI I GTCTTATTTGTACMTAGAGTCACTGGCTAGAAAATC I IGI I CC 
TCTTCCAGCTGATGGTCTATGGrrCATTTGTATTCTTTTCCCriTGMG I IGI IGATATT ^Lj 

tgrrtgggaacaaaggatatgaactcattatagctgi i i i cctctttcctttaagggagg h ~ 
[a,g] 

tattatataataattctcaaci i c_ i i laatctagacatcagtaacctcagtcttcattct 
cactamtagcaamctttccccatamttctgatttacctcataaaa^ 
ctttgnagtattttgatgtctttgatttacrttg 
mgcctgacaattgatctttggcagccaggttccttctagaatgg i i i i cagaagl i i i i 
caggtagtctggacrccrggcagtagtacritgctgactctactagg i ici i i icctcat 

SEQ ID NO: 31 

18528 ACAAGAGATGTGTCAATATCTGTCC I I IGI I GTCTTATTTGTACAATAGAGTCACTGGCT 
AGAAAATC I IGI I ICI I CCAGCTGATGGTCTATGGTTCATTTGTATTL I I I I CCCTTTGA 
AG I IGI IG^TATTTGCrTGGXW\CAMGGATATGAACTCATTATAGCTGI I I ICCTCTTT 
CCTTTMGGGAGGATATTATATAATAATTCTCAAL I IGI I I AATCTAGACATCAGTAACC 
TCAGTCTTCATTCTCACTAMTAGCAAMCTT^^ 
[G,A] 

AMTTTCAGAACACTTTCAAGTATTTTGATGTCTTTGA^ 
GCAGTTACTCCAGAAGCCTGACMTTGATCTTTGGCAGCCAGGTTCCT^^ 
TTCAGMGCrmCAGGTAGTCTGGACTCCTGGCAGTAGTAaTTG^ 
TTCTTTTCCTCATTTAAAGTCATCTCATTATGAAATGCAA^ 

CTGTITCATCrrTATGTTMTTATATTCTT ATTCAGTGGGQ\AGCTTA(TGACCTACGTG 

SEQ ID NO: 32 

18722 TATTATATAATAATTCTCAACI ICI I I AATCTAGACATCAGTAACCTCAGTCTTCATTCT 
CACTAMTAGCAAMCTTTCCCCATAMTTCTGATTTACCTCA^ 
CTTTCMGTATTTTGATGTCTTTGATT^^ 

MGCCTGACAATTGATCTTTGGCAGCCAGGrTCCITCrAGAATGG I I I ICAGAAGCI I I I 
CAGCTAGTCTGX&CTCCTGGCACTACTACTTTGCT I ICI I I ICCTCAT 

[T,C] 

TAMGTCATCTCATTATGAAATGCAAMGCTTTCTATGTT^ 
TGTTAATTATATTCTTATTCAGTGGGCMGCTTACTGACCT^ 
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TCTTCTAGGGAAATGATTG I I I I I AAGACTGAAGGACTAGTGTTTAAGAAAAATGGA/% < 
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GMTCCTCATTAGCTCrCTMGACAMTTTAMTCAGCTATMGTTrATGTACT 
GTCTTCATGATTAGCAATATAGATATAC I I I I I I ATTATTATTTTCATTTTGAAAAGTGA 

SEQ ID NO: 33 

18775 TCATTCTCACTAMTAGCAAMCTTTCCCGVTAMTTCTGATTTACCT 
CAGMCACTTTCMGTATTTTGATGTCTTTGATTTACrrTGAA^ 
ACTCCAGMGCCTGACAATTGATCriTGGCAGCG^GGTTCCrrCTAGMTGG I I I ICAGA 
AGCTTTTCAGGTAGTCTGGACTCCTGGCA^ I ICI I I 

TCCTCATTTAAAGTCATCTCATTATG 
[C,G] 

ATCTTTATGTTAATTATATTCTTATTCAGTGGGCAAGCrrACTGACCTACGTGA^ 
CTGTTCCTCTTCTAGGGAAATGATTGI I I I I MGACTGAAGGACTAGTGTTTAAGAAAAA 
TGGAMTGMTCCTCATTAGCTCTCTAAGACAMTTTAMTCAGCTAT^^ 
TAAATATGTCTTCATGATTAGCAATATAGATATAL I I I I I I ATTATTATTTTCATTTTGA 
AAAGTGAI I I I I I I I I GTMGTTTAAAAAACAAAGCTTGGTG I ILI I ICI I I I ICCAGTC 

SEQ ID NO: 34 ^ 
18951 CAGMGCTTTT(^GGTAGTCTGGACTCCTGGCAGTAGTAaTTGCT LJ ^ g 

CI I I I CCTCATTTAAAGTCATCTCATTATGAAATGCAAAAGCrrTCTATGTTAGGAGCCT LjJ g § 
GTTTCATCTTTATGTT AATTATATTCTTATTCAGTGGGCAAGCTTACTGACCTACGTGAA > ^ ^ 
ATAGACTGTTCCTC1TCTAGGGAAATGATTG I I I I I AAGACTGAAGGACTAGTGTTT AAG (JJ 35 
AAAAATGGAAATGAATCCT CATTAGCTCTCTMGACAMTTTAMTCAGCTATMGTTTA "° 
CT.C] UJ 
GTACTAAATATGTCTTCATGATTAGCAATATAGATATAL I I I I I I ATTATTATTTTCATT 
TTGAAAAGTGAI I I I I I I I I GTMGTTTAAAAAACAAAGCTTGGTG I ICI I ICI I I I ICC 
AGTCGGTCCCGGAGAAAMTGCAMCGGTGTCAMTATT^^ 
ATGTACCTGCTTGCCGCCCTCTTT^^ 

TCATTCTCTATATGCAAATCC I IGI IAGGCTGGTCCTTGACCTGGGTAGGTATGAI I I I I 

SEQ ID NO: 35 

18974 ACTCCTGGCAGTAGTACTTTGCTGACTCTACTAGG I ICI I I I CCTCATTTAAAGT CATCT 
(ZATTATGAMTGCAAMGCTTTCTATGTTAGGAGC^ 

TTCTTATTCAGrGCXiCAAGCTT ACTGACCTACGTGAMTAGACTGTTCCTCTTCT AGGGA 
AATGATTG I I I I I MGACTGAAGGACTAGTGTTTAAGAAAMTGGAMTGAATCCTCATT 
AGCTCTCTAAGACAMTTTAMTO\GCTATAAGTTTATGTA(TA^ 

[T,G] 

AGCAATATAGATATAC I I I I I I ATTATTATTTTCATTTTGAAAAGTGA I I I I I I I I IGTA 
AGTTTAAAAAACAAAGCTTGGTG I ICI I ICI I I I I CCAGTCGGTCCCGGAGAAAAATGCA 
MCGGTGTCAAATATTTCCATCACGGGGATGCTTGTCATGTACCTG 
TGCnTACCTAACCTTCTATGCTAGCTCACTCT 

GTTAGGCTGGTCCTTGACCTGGGTAGGTATGAI I I I I AAAAATTGCCTTCTATAAGCATG 

SEQ ID NO: 36 

19540 GGTATGAI I I I I AAAAATTC^CTTCTATMGCATCCTCTATAGATGACACATATTCAATT 
AATATACTATTTTAG I I I IGTCAOTGACCTGA03AMTGGGGCCTGATT(^C^CTGXT 
AA<^GTTACAAGMTTTGTGMTTAACACCTATTTTATAAAAMTATCCCT 
ATTATTTTCCTCTAGGGATAGATGATATTTCTCTGGCTA 
CTACAAGTGATGAGMTGMTCCACTTGCATGTGATAAAG^ 
[A,C] 

CTC^QXG^CAAATAGG^^ 

CCAGTC7\AGTC7\MTCTGGGAGCTTGGGACTTTAGCTAAMTTTC^ 
TATTTTGTTATACTAMTGATTTCCTA^ 
MGMTGC7^CCACC7WWWV\GTGACTATGGCC7\CATrAGA 
CTCTCTGGCATCTTMCAGTTCACAMGGGAGTAGGATTGTA 

SEQ ID NO: 37 

19841 CTGCCACACAMTAGG\GGGAM(TGC^ 

CC^GTCAAGTOWXTCTGGGAGCTTGGGACTTTAGCT 
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TAI I I ILI I ATACTAAATGATTTCCTMGAMGAGGACATGACAGMTTTCCTTC^ATC^ 
AAGMTGOVCCACCAAAAAAMGTGACTATGGCCACATTAGATTATGCCrGCMC^TTT 
CTCTCTGGCATCTTAAG^GTTCACA^ 
[G,A] 

CCACATAMCAGATTTCATGGMTCACATATTGACCTGGTAGCATATGTTTACATGAATC 
AGTGTATCAATATAAATATA I I I I I GTATAAACCTCL I I I IAAALI I I I IAACTTAATTT 
I I I I LI I ACrGACTTGGTAMTTGAATTGCATGTATGACAMTTGTGGAGGAAMGATTC 
AGGAGTAGGCCACCATTTGCTTAGLI I I I I I I I CTATTTGACTAATATTTGACTATTAAC 
CAMCATGTGCTTTAGATTGGGCATTAAL I I I I I GCCGGTTGTGAAATAATGAATGACGA 

SEQ ID NO: 38 

20170 TATTGACCrGGTAGCATATGTTTACATGMTCAGTGTATCMTATAAATATAI I I I IGTA 
TAAACCTCL I I I IAAALI I I I I AACTTAAI I I I I I I C_ I I ACTGACTTGGTAAATTGAATT 
GCATGTATGACAMTTGTGGAGGAAMGATTCALX^ 
I I I I ICTATTTGACTAATATTTGACrATTMCCAMCATGrGCTTTAGAT^ 
LI I I I I GCCGGTTGTGAMTMTGAATGACGAGGTCMTACrACTGMGGrATTTTCACT 
[A,C] 

LI I I I I CTCrGATCTTGAGGTGAAAATCCAACTACGCTTGATTCCATAGATA I I I ILI IG 
TTATTTGTGCTTGGAGTCCTGAATGAAGGTCi I I I I CAAGTAGGGCTGCATCTTCGTCTTA 
GAGTAGTACCCACTGGGAGACCATCTAAAMTTATACTMTTTATCCCTGC^ 
ATACTTATTTTMTGAGTTTCATMGACMGC^ 

GTTTT AGTGTGGTGATGGAGTCATAL I ILI I GAGCTTGAAAAAATGGTAGCAATCATTCA 

SEQ ID NO: 39 

20343 TAGLI I I I I I I I CTATTTGACrMTATTTGACrATTMCCAMG^TGTGCTTTAGATTGG 
GCATTAAL I I I I I GCCGGTTGTGAAATAATGAATGACGAGGTCAATACTACTGAAGGTAT 
TTTCACTALI I I I IGTCTGATCTTGAGGTGAAMTCCMCTACGCrrGATTCCATAGATA 
I I I I CI IGI I ATTTGTGCTTGGAGTCCTGAATGAAGGTL I I I I CAAGTAGGGCTGCATCT 
TCGTOTAGAGTAGTACCCACTGGGAGACCATCTAAAMTTATACTMT^ 
[T,C] 

GTTACTT ATACTTATTTTMTGAGTTTCATMGACAAGCAAAAACTTGAMG^ 
MTATCTGTTTTAGTGTGGTGATGGAGTCATALI ILI I GAGCTTGAAAAAATGGTAGCAA 
TG^TTCATCCTAGAGTTTACACACTGGGTTTGTMCCTGCATCAG 
TAGGGACAGGGGAGGTGGTAGGCTGGGAGAGACAATATGTGGGGCTTGGGTCT 
CCrrCMCAAGAGCACCrTGGTCTCTGTCTGATTTGTMT^ 

SEQ ID NO: 40 

20519 GATA 1 1 I ILI ILI I ATTTGTGCTTGGAGTCCTGAATGAAGGTL 1 1 I I CAAGTAGGGCTGC 
ATCTTCGTCTTAGAGTAGTACCCACTGGGAGACCATCTAAAM 
TGCACGTTA(TTATACTTATTTTMTG^ 

CCAAAAATATCTL I I I I AGTGTGLTGATGGAGT CATAL I ILI I GAGCTTCAAAAAATGGT 
AGCMTCATTCATCCTAGAGTTTACAC^^ 
[G,A] 

CAGGTAGGGACAGGGGAGGTGGTAGGCTGGGAGAGACMTATCT 
ATCCCCTTCMCAAC^GCACCTTGGTCTCTGT CTGATTTGTAATTGCTTCTGTACAGCGG 
AGATAGATTTATCAO\ATGTAMTGAGCTTGAGAGGCT^ 

TGCAACGTTATCAGCTTCAGGACCTL I I ILI I CATTTGAATGAAGGTTGCATAGCTAATG 
AGCTCAGAGGCAAGACCAGAGGTGCCTGGATrCCCAGGCCTAGCT 

SEQ ID NO: 41 

20963 TGAGCrTGAGAGLXITCTTTATTTTCT 

CTGTTGTTCATTTGMTGAAGGTTGCA^ 

GCCTGGATTCCCAGGCCTAGGTL I I I ICCTCTGTTCTGTGTTCTCTCTATAAAATGTTGC 
CATMGTGACCTGTGCTGATTTGACMCACCMGCGGTTT I I I I ICCTGTTGT 

AGGAGMGTTGMGATGMTTACTTO\TCXCT 
[T,C] 

CrrCTCATGGTTCGCCTGGCAGTCClTGTGGCAGTMCACrM 
TTCCCAGTMGTACATMGACTTTGATGAMGAMCC^^ 
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ATGTGTTCTACCTTCATTTTGATTTMTTATAGGGTGAGT^ c «/ 
GATATTATTTTCCTATAGCATTTTGAGTCACTTAAM^ ^ 
AGCAAGTAGTTTCAGGTGGTA I I I I I ATAGTGTAGGAAAAAAATCATAAAACTTA I I I I I 

SEQ ID NO: 42 

21840 AMCAGTTATGCTATCTATCACATATCTCT 

AGGTCACCCCTCCCTGGCATTTGTCATTGGTGTCAG I I IGI I CTGAGATCCCAGAGCAGA 
GCTGGTAGTGAAGATTTGGGCTGTGTGAGTTAAMCCACCACCTMGGATA 
CTTCACCCTCCTGCCAGCrCCTGT^ 

GAAAAAAATAAGTGACACAGTAACCAGCACrGT CCTGGACATAATGTTCCATACAGGGCT 
[G,T] 

GCATATGMGACTATTTCTATMTGACACTGTGGTCACrTTA^ 
AAATATATTTTGGCACATTCC I I I I I CATGAGTGCATGAAATCAGATCCGTACTACTATG 
GTGGCTMTATTTTACTCrTAMTCATGTCrrGCCrCTMTATATCT 
ATG^CATACACATAGCTTTAGCCTAAMTCAGC^CCGTCTTGGGT ACAAGACAGAAGACA 
ACTATAMCAGMGGTATACGATAGGGTAAMTTGCCAGGCAMCMCrrC^CTG^ 

SEQ ID NO: 43 

22783 TGAGAMTAMGCACTGATATAMTCTGACCATCAGGAACAGCMTAGTGTGTAM 
AGATGCCATTAGAACCAAAATTGACCATAAGAACCAGAGTTCAGAAAAATG^ 
TGTCCTTCATTATGTATTTCCACTG\ACATTAGCATTTATG^ 
CTGTCCTCACCCTTGCAATGTTACATTTATATMTCTGTGT^ 
AGAGTCATMGTCCCTGGGACTTGGTGATGTGCACAGTGACTGGCA<^ 
[C,T] 

GTCGTGCTTGGGMGAAAMTGGTCTTCAAATGMTCTTGCCTTGTCTTGAMT^ 
ACTGCU I I I CTAGCAAMGCATAGACACTCTTTCCCTTGGTGACATGTGCTACGMTTC 
AGCTGGGTTGAGGATCT GGGCrAMTGAACCAAACCTCCCTATACATGAAGGATACACAG 
AGATGGTGACAGAGAGTGGT(^CTTCCGTGAGTGGATCTCMTCMGTCCTCTGMGCT 
AATTCAAI I I I I I I ICI I I ACTAAAATGATAAAAG I IGI I ATTGGCGL I I I IGCI IGI I I 

SEQ ID NO: 44 

22787 AMTAMGCACTGATATAMTCTGACCATCAGGMC^ 

GCCATTAGMCCAAMTTGACCATAAGMCCAGAGTTCAGAAAMTGAC^ 
CTTG\TTATGTATTTCCACTCAACATTAGG\TTTATGAM 
CCTCACCCITGCMTGTTACATTTATATMTC^^ 
TCATAAGTCCCTGGGACTTGGTGATGTGCACAGTG^CTGGCACAGAGGGTGA 
[G,A] 

TGCTTGGGAAGAAAAATGGTCIT(^W\TGAATCTTGCCTT GTCTTGAAATGTATAAACTG 
CCTTTTCTAGCAAAAGCATAGACACTCnTCCCTTGGTGACATGTGCTA 
GGGTTGAGGATCTGGGCTAMTGAACCAAACCTCCCTATACATGMGGATACACAGAGAT 
GGTGACAGAGAGTGGTCACTTCCGT GAGTGG^TCTCMTCAAGTCCTCrGAAGCTAAATT 
CAAI I I I I I I ICI I I ACTAAAATGATAAAAG I IGI I ATTGGCGCTTTTGC I IGI I IATTT 

SEQ ID NO: 45 

22825 CMTAGTGTGTAMCATTAGATGCCATTAGMCCAAMTTGACCATM 
AGAAAMTGACTMCTGCTGTCCTTCATTATGTATTTCCACTCMC^ 
MCATTTTGCACATTATCCTGTCCTCACCCTTGCM 
AGTGCTCCACTGCCCCACAGAGTCATAAGTCCCTGGGACTTGGTGATCT 
GGCA^GAGGGTGAGCrCTGTCGTGCTTGGGAAGAAAAATGGTCTTCAM 
[T,C] 

TTGTCTTGAMTGTATAMCTGCCTTTTCTAGCAAMGCATAGACACTC^ 
GACATGTGCTACGMTTCAGCTGGGTTGAGGATCTGGGCTAMTGAACCA^ 
TACATGMGGATACACAGAGATGGTGACAGAGAGTGGTCACTTCCGTGAGTGGA^ 
TCAAGTCCTCTGAAGCTAAATTCAA I I I I I I I ICI I I ACTAAAATGATAAAAG I IGI I AT 
TGGCGCI I I IGCI IGI I I ATTTCGTATAACrTAGGGCTCAGATTTTCMTGTGTCAAATG 

SEQ ID NO: 46 

22967 CCTCACCCTTGCMTGTTACATTTATATMTCTGTGTMGTGCTCCA 

T(^TMCTCCCTGGGACrTGGTGATGTGCACAGTGACTGGCACaGAGGGTGAGCT 



FIGURE 3W 



Title: 



Docket No.: CLO0101O 
Serial No.: 09/776,705 
Inventor: Karl GUEGLER et al. 
ISOLATED HUMAN TRANSPORTER PROTEINS. 



GTGCTTGGGAAGAAAMTGGTCrrOW\TGAATCTTGCCrrGr(XrGAMTGTATAAAi 
<XCTTTTCTAGCAAMGCATA<^ 

TGGGTTGAGGATCTGGGCTAMTGMCCAMCCTCCCrATACATGAAGGATACACAGAGA 
[A,T] 

GGTGACAGAGAGTGGT^CrrCCGTGAGTGGATCrCMTCMGTCCTCTGMGCTAMTT 
CAAI I I I I I I ILI I I ACTAAAATGATAAAAG I IGI I ATTGGCGCTTTTGC I IGI I IATTT 
CGTATMCTTAGGGCTCAGATTTTCMTGT^ 
TGACAGTTTATTTCATTTMGGMCTCT^^ 
CTCCACACATTMTMTAAMCTM(IAAMCCTA^ 

SEQ ID NO: 47 

2 3248 O^TGAAGGATACA<^GAGATGGTGA(^GAGAGrGGT(^(XrCCGrGAGrGGATCTCAATC 
AAGTCCTCTGAAGCTAAATTCAA I I I 1 1 I I ICI I I ACTAAAATGATAAAAG I IGI IATTG 
GCGCTTTTGCI IGI I I ATTTCGTATMOTAGGGCTCAGATTTTCMTGTGTCAAATGCT 
GACTCACAGCATGGTTCTCCTGACAGTTTA^ 

TTATTTACTTGCCrTGATATCTCCACACATTMTMTAAMCTMCAAMCCT 
[A,G] 

TTAAMTOAT^GCTTTAGGC^TTATrTTGTG^CTCCI ICI I I CAACATGGTAACTGG 
GCTCTCI I ICI I AGGAGCrrGAGAAGATATGACTGGGG I I IGI I I I I CTCTACTTCATTT 
ATTATC I I ICI I I I I I CCAATCAGGTTAG I I I I I ICCI I I I I AGTAAAAGGTGCATAGTA 
ACTGCTTGTAGTAI I IGI I GMCMGTGAATAAATGAAATGAATTAAGGTAGTG I I I ICA 
CTAGCAGCCCAACA I I ICI I I CTCTCTTAGTAGTGGXJTGGGGTATCAGTTATGGAATGGC 

SEQ ID NO: 48 

23764 GAMTGAATTAAGGTAGTGTTTTCACTAGCAGCCCAACA I I ICI I I CTCTCTTAGTAGTG 
GGTGT^GXTTATCAGTTATGGMTGGCACCTCCTTCCAGAGGACT 
CTTATGCTTCCOTrATGCAGTAAAGTTrCCATATTTCCATAA^ 
MTCCTAATGGATATATAATGAACACACAGA^ 
AMGATCCCTAGCTACTTGTATTTCATCTr^^ 

[G,T] 

TCTTCAGATCTCCTG I I I I GAAGTGTATATAG^TATCAACATAGAAATGCAGCGTATATT 
GCTATCAACTGCAGTGGAGCAGTGATTCGTAGGI I I I CCAACATCCTTGCCTTAAGCAAA 
CCTGCAAMTCAMGTGTGAGCTACGTCTAAACAATGGGAGAGGC I I I I I I I I I I I I I I I 
MGAGTTAGAACTMGACTCTCACTTCCTCCTGTGCCTCCACA I I I 1 1 GACCTTCACATT 
GGGCCCCTGCATCAGMTACAGCACCCCCTMCAGGCrCCTGTTCA^ 

SEQ ID NO: 49 

23765 AAATGAATTAAGGTAGTG I I I I CACTAGCAGCCCAACA I I ICI I I CTCTCTTAGTAGTGG 
GTGGGGTATCAGTTATGGAATGGCACCTCCTC 
TTATGCTTCCCTTTATGG^GTAAAGTTTCCATATTTCCATAAAGAA 
ATCCTMTGGATATATMTGAACACACAGATGAAAATTTCACCT 
AAGATCCCTAGCTACrrGTATTTCA^ 

[C,T] 

CTTCAGATCTCCTG I I I I GAAGTGTATATAGATATCAACATAGAAATGCAGCGTATATTG 
CTATCAACTGCAGTGGAGCAGTGATTCGTA&TjriTT 

CTGCAAAATCAAAGTGTGAGCTACGTCTAAACAATGGGAGAGGCI I I I I I I I I I I I I I IA 
AGAGTTAGMCTAAGACTCTCACTTCCTCCTGTGCCTCCACA I I I I I GACCTTCACATTG 
GGCCCCTGXIATCAGAATACAGC^CCCCCTMCAGGXrrCCTGTrCAGGACT 

SEQ ID NO: 50 

24432 GGATGGTGCTGGGGACCTCCCTGACCCACAGCATCTGACCCACATT^ 

GACTTGTGTCAGTAAAGAAAAAGGCACATAGCTMGTGGMGAGCAGATG^ 

GXy\ATCAGCCAGTGGTCTGX;CCTAGCAMGGTAMCAGMCTGCTGGG 

TAGGCTCACTACTCAGGGAGGCACTTTMCATGGAATC^CCAGCAA 

CTTTTCCACCACCZACCACAAGCCTAGTACCTCCCTCCCTCTrrG 

[A,G] 

OWVTGCACTGGAMCCACCTTCAGTTCTGTTTGGAATTTTCCT^ 

GAGGAAGAAGCI I I IGG^TTTACTCCAACCGrrCTACCTATTATrCCCATAMCTTTCTG 
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TGATCTCATATCATTAGGCCAMTGTTMTCT^ 
TTG4GAGGCCCTGGACATATAGGACTGCCTCT 

GAATGCACL I I I I I MCMGTGACTAAAAMCAMCTGTGACTATTCTCTGAAAATGAGC 

SEQ ID NO: 51 

GATGAGGCTTGGTGGGMTCAGCCAGTQGTCTGCCCrAGCAMGGTAMCAGAACT 

GGGGCTTTTGGTCCTAGGCTCACTACTCAG 

GTTTCCTTCCTGATCTTTTCCACCACC^^ 

CTGTTCCTUrCTTCGGGAATG^ 

TTCCTTATTCAGAAAGAGGAAGAAGL I I I IGCATTTACTCCAACCGTTCrACCTATTATT 
[C,G] 

CCATAMCTTTCTGTGATCrCATATCATTAGGCCAMTGTTMTCTTTCT 
AGACTGCTITCACATT^GA(^CCTGGA(^TATAGGACTGCCTCrMCT(^CTCrMCr 
CAGCTTATTGACTTGAATGCACC I I I I I MCMGTGACTAAAAMOWttTGTGAOATT 
CrCTGAAAATGAGCCT ATATCTCATACTTATTTATTCTGTTTMCACrGTGAMCAAATT 
MGTCCTCTGGCACTATGrATATACCATAAAMGCrTATTTGTMGCCTACrMTTGGAC 

SEQ ID NO: 52 

CCTAGTACCTCCCrCCCTCTTTGCTCTGT^^ 

TT<^GTTCTGTTTGGMTTTTCCTATTCaTATT<^GAMGAGGMGAAGLI I I IGCATT 

TACTCCMCCGTTCTACCTATTATTCCCATAMCTTT^ 

AAATGTTMTCTTTCTGGGAGCCAGGAGACTGCT^ 

AGGACTGCCrCTAACT CACTCTMCTCAGCTTATTGACTTGAATGCACC I I I I I AACAAG 
[T,C] 

GACrAAAAAACAAACTGTGACT ATTCTCTGAAMTGAGCCTATATCTCATACTTATTTAT 
TCTGTTTAACACTGTGAMG\MTTMGTCCTCTGGCACTAT 

TTATTTGTAAGCCTACTAATTGGACCAGI I I I GACAATATTGAATAAGCACTAATTGCAG 
ATCATMTGTAGMTTATAGGCTGCTGAGGAAMCMTATCACACCATT^ 
GTTTCCI I I I CAGMTGAGTTTCATAATGTTCACTAATCCAA I I I I I AAAATCCTTTACA 

SEQ ID NO: 53 

AACCGTTCTACCTATTATTCCCAT^ 

TMTCTTTCTGGGAGCOVGGAGACTGCTTTCACATTCAGAGGCC^ 
GCCTCTMCTCACTCTAACTCAGCTTATTGACTTGMTGCACC I I I I I AACAAGTGACTA 
AAAAACAAACTGTGACTATTCTCTGAAMTGAGCCTATATCTCATACTT ATTTATTCTGT 
TTAACACTGTGAAACAAATTAAGTCCTCTGGC^CTATGTATATACCATAAAAAGCTTATT 
[C,T] 

GTAAGCCTACTAATTQGACCACj I I I I GACAATATTGAATAAGCACTAATTGCAGATCATA 
ATGTAGAATTATAGGCTGCrGAGGAAAACMTATC^C^CCATTTGCTTT CCTCAGTTTCC 
TTTTCAGAATGAGTTTCATMTGTTCACTAATCCAA I I I I I AAAATCCTTT ACAAAGTTA 
TTCTTAMCTATTTCCAGAGACTATCTGGTTTGTC^TTCTAGAMT I I I 1 C 

AGCCTAAACAGATGGCCTTAA I I I I I GGTGGAGTGGTATGAAAGGAATGTCACATGAGAA 

SEQ ID NO: 54 
TATCCAGTTACAGCAGCGTMCTTGAGCAGCTGCTGCAMCTGAGG 
CGCCTACTTATTTCAGCTGCTAAMTAGGGCT^ 

GATMGATTCCrACTATTCAATTTAATTTAAG(_l I I I ATTCAGTGCCTGCTGTGTGCACA 
ACACTMGCTAGAAAGTCTGAGGMTGTTTAGATTATTAGGTCCTG^ 
TAGATTTACAATCTATTGATAGGGAGAGCTAAAMGGAGAGAMGAGGMGGAGCAAACA 
[C,T] 

AAAMCGTCAAMTTTTAAMTACCATTTTAA^ 

GCAAAATTAAGGAAAACCT AGATTCATAAAMTTCCTITCAC^ATCTTGTGTAAATG\AT 
TCAGTGCTTGCCCTTAATGTCTCATCCAGTCT GATGAGACATGTTTTGT GATCAACAAGG 




GUI I ACTATG I MCI lAATTATGTGTCTTGCCTGTTATCTaTTCTGACCGAGATTATT 
TTTAACMTAMTTCTGAAMCTMGAMGTGAMGCATAAMTATTCT 

SEQ ID NO: 55 
AAAAACGTCAAAATTTTAAMTACCATTTTAAMTTTTATTTTA 
GCAAMTTMGGAAAACCTAGATTCATAAAMTTCCTTTCACMTCTTGTGTAM^ 
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TCACTGCTTGCCCrTAATGTCT(^TCCACTCrGATG^GA(^TGrrTTGTGATGV\CAA< 



GTTTTACTATG I MCI I AATTATGTGTCTTGCCrGTTATCrCTTTCTGACCGAGATTATT 
TTTMCMTAMTTCTGAAMCTMGAMGTGAAAG^^ 
[G,C] 

GCCMGGAAAAMTGACACTCCATTTC^MTATCAAMGrTAGCATCAAGACT 
ATGMTGTACAGTCATGTGTTGCTTACAMTGTGGAO^TATTCTGAGAMTGCA 
GGCMTTTTGTCATTGTGCAMCACCATAGATTGrACTTGCAGCCTM 
ACTATACACTMGGCTATATGGCATAGCCTAGTACTCCTAGGCrACAMCCT 
TGTTACTGTACTGAATAGrGG^GGrACCTGTAACATAATGGTAAGTATTTGTGTCTCCAA 

SEQ ID NO: 56 

26555 AGTACTCCTAGGCTACAMCCTGTACAGCATGTTACrGTACTGMTAGTGC^ 

TMGATAATGGTAAGTATTTGTGTCTCCAMCGTAGAAMGCrACTGTAAAMTA 

TTACMCCTTAGGGTATCACTGTCTTATATGTGGrCTG I IGI I GACCGAAATGACTATGC 

TTMTACCACTGAACTGTA(^ClTAAAMTGGrrAAGATGGrAMTTCTATGT^ 

GTTTTATMTMTAAAAAMTTGAAAAMGCATCAACATCTTTTCTQGG^ 

[G,A] 

GAMGAAAATGCATTAGAGTGATGAGMTATTTGAAGTAATAGATAMGTCAAA^ 
GAAATGATCTTGCCrrTGAAC I I ICI IGI I I AAGATTCGTACATCAGTGATCACACTGTT 
ATTTCCCAMCGACCCTTCAGCrGGATACGACATTTCCTGATTGC^ 
ACTTMTMTGTTCTGGTCATCCTTGT^ 
TGAGTTTCAGAMGGCTTCMTTTGGTCAACCCAMCTCACG^ 

SEQ ID NO: 57 

27886 GGTTTATTTAAACTGTCTGCTGGCATCTC 

CCTTGCCCTGTGTTTGTCTTCTCAGGGGL I ICI I CTGCCACTATGCTGA I I I I IATTCTT 
CCAGCAGI I I I I I ATCrTAAACTTGTCAAGAAAGAAAC I I I I AGGTCACCCCAAAAGGTC 
GGGGTMGTAMCCTTGCAATTTCCCCCATTATTAG I IGI ILI I CCAACTACTTAGAATA 
MCTAGAAMTACACATAGTTCAGAAAMTGMTCMT^ 
[A,C] 

TGGGCTAGMCrrrCTGGTAGCAGAGAMGGGGACATAT^ 
ACTTCAMTATCAMTATCCTGTGTTGAGTCTGTCATA<^^ 
TCCCACAGACAG\TATGCrrCAGGCAAATAGCAGTGTCCAATACCAAGCT 
TATCCGTGGAAMTCATGCMGAAGGAATTAGGCTCCCTAGCGGTGT^^ 
AAATATTTTGGTCATGG I I G I I AGGTTTGCAAAGCCAAAGGAAAGATGTTGL I I I I G I I I 

SEQ ID NO: 58 

31884 CI I I I ATGGTTAGTTTGAAAGAATCCATTG7V\G^TAGAAAATGAGAGAATAGAAGAAACC 
TGAGAATAGTAAAATAAAGAGG^GAGAAAATATGGGGGCAGGG7WV\CATGTGACT 
AGGATTGATTATGAATGAACGATTAGGGGGATTGATGGATCACAGGGTMGTATATG 
AACTTTATAAGAAACTTCCACATAGI I I I CCACAGTGTTTCTACCATTTTCATTTCCACC 
CGTACTACCTACMCTTCCACTGACTCCACAGCCCTGCCMCATT^ 
[T,C] 

ATTTTAGCCTTTCT AGTGGGTCTGAMTGGTAACTCATTGTGATTTTCAT^ 
GTGACAACTAATGTTGAAAAC I I I I CAAGTGTTTAATGGTCACTCATATATC I ICI I I IG 
TGAAGTGTGTATTCAAATC I I I IGCCCAI I I I IAAAATTTAGGTTATGTGI I I I IATTGG 
GTATTTGTAGAAGXTCTTTAMTATGX^ 
TCTATGGTATGGTTGCTTATTTTCCTAMGGTGTCTTM 

SEQ ID NO: 59 

32229 TTTCATTrCTGCTTCTGTGACMCTMTGTTGVV\AAC I I I I CAAGTGTTTAATGGTCACT 
CATATATCI ICI I I I GTGAAGTGTGTATTCAMTCTTTTGCCCA I I I I IAAAATTTAGGT 
TATGTG I 1 1 I I ATTGGGTATTTGTAGAAGCrCrrTAMTATGGATCCATGTCCAGATTGC 
CMTATATTTTCCCAGTCrATGGTATGGTTGCrrATTTTCCTAMGGTCT 
TCTTTCTGGGXKiCAGGTCIACCATAGCTCAMGlTTTGO 
[T,A] 

ATTMTCAG7\GTGXrrATAGTCAAAATTAAATG I I I I GATGTCCTGGGCCCATATAGGTAG 
GACTGGATCATCTMCCAAGATGCAAAAAAAAAAAMCAA^ 
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GAAAAACTTATTTTAAATTAAACA 



SEQ ID NO: 60 



TO 

<5^ 
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